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Patent Examining Operations 

Applicant(s): Van den Bos and Mbalaviele 

Serial No: PCT/US00/08751 Art Unit: Unassigned 

Filed: 31 March 2000 Examiner: Unassigned 

Title: HUMAN MESENCHYMAL DNAs AND EXPRESSION PRODUCTS 

Docket No: 640100-430 



Commissioner for Patents 
Washington, D.C. 20231 



Preliminary Amendment 

Sir: 

The above-referenced PCT application is filed herewith as a National-Stage 
application under 35 U.S.C. 371. Prior to examination on the merits, please enter the 
following amendment. 

In the Specification: 

Please amend the paragraph starting on page 1 , line 8, to read as follows: 

"This application is a national stage filing based on PCT/US00/08751 and claims 
the priority of U.S. Provisional Applications 60/148,800, filed 13 August 1999, and 
60/127,418, filed 1 April 1999, the disclosures of which are hereby incorporated by 
reference in their entirety." 



REMARKS 

The application filed herewith is a national stage filing under 35 U.S.C. 371 and the 
specification has been amended to reflect this. 



The Commissioner is authorized to charge payment of any additional filing fees 
required under 37 CFR 1.16 associated with this communication or credit any 
overpayment to Deposit Account No. 03-0678. 



i EXPRESS MAIL CERTIFICATE 

Express Mail Label No. EL680647645US 

.ft 

Deposit Date: 1 October 2001 

I hlreby certify that this paper and the 
attachments hereto are being deposited today with 
the U.S ftostal Service "Express Mail Post Office To 
Addressee" service under 37 CFR 1 .10 on the date 
indicated above addressed to: 



f ^Commissioner for Patents 
Washington, DC 20231 



Alan J. Grant, Esq. 



<>ltfo( 



Respectfully submitted, 

Alan J. Grant, Esq. 

Reg. No. 33,389 

CARELLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 

6 Becker Farm Road 
Roseland, NJ 07068 
Tel. No.: (973) 994-1700 



AMENDED SPECIFICATION 

The paragraph starting on page 1 , line 8, has been amended as follows: 

"This application is a national stage filing based on PCT/US00/08751 and claims 
the priority of U.S. Provisional Applications 60/148,800, filed 13 August 1999, and 
60/127,418, filed 1 April 1999, the disclosures of which are hereby incorporated by 
reference in their entirety." 
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HUMAN MESENCHYMAL DNAs 
AND EXPRESSION PRODUCTS 

5 

This application ciaims the priority of U.S. Provisional 
Applications 60/148,800, filed 13 August 1999, and 60/127,418, filed 1 
10 April 1999, the disclosures of which are hereby incorporated by reference 
in their entirety. 



15 BACKGROUND OF THE INVENTION 

This invention relates to newly identified polynucleotide 
sequences corresponding to transcription products of human genes, and 
to complete gene sequences associated therewith and to gene expression 

2 0 products thereof and to uses for the foregoing. 

Osteoblasts, key cells in bone formation, or osteogenesis, are 
formed from mesenchymal stem cells. Such mesenchymal stem cells {or 
MSCs) of numerous mammalian species can be induced to differentiate 
25 into connective tissue cell lineages by varying the in vitro culture 
conditions. Osteogenesis, the differentiation into bone cells, has been 
reported as a means to generate replacement bone from cultured and 
implanted MSCs (Bruder et al. Growth Kinetics, Self-Renewal, and the 
Osteogenic Potential of Purified Human Mesenchymal Stem Celis During 

3 0 Extensive Subcultivation and Following Cryopreservation, J. Cell 

Biochem. , 64(2):278-294 {Feb. 1997); Jaiswal et al., Osteogenic 
Differentiation of Purified, Culture-Expanded Human Mesenchymal Stem 
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Cells In Vitro, J. Ceil Biochem. , 64(2):295-31 2 (Feb. 1997), Kadiyala et 
al., Culture Expanded Canine Mesenchymal Stem Cells Possess 
Osteochondrogenic Potential In Vivo and In Vitro, Cell Transplant , 
6(2):125-134 (Mar-Apr 1997)). 

5 

The process by which MSCs undergo osteogenic 
differentiation in culture is marked by the development of an osteoblastic 
morphology, the deposition of a hydroxyapetite mineralized extracellular 
matrix characteristic of osteoblasts and the presence of terminally 

10 differentiated osteocytes, as well as the expression of alkaline 
phosphatase (Jaiswal et al., Osteogenic Differentiation of Purified, 
Culture-Expanded Human Mesenchymal Stem Cells In Vitro, J. Cell 
Biochem. , 64(2):295-31 2 (Feb. 1997)). Mechanisms underlying the 
osteogenic differentiation of human MSCs (hereafter, hMSCs) are poorly 

15 understood. Identification of proteins produced during this process would 
greatly facilitate the discovery and development of small molecules that 
target the osteoblast and its bone forming potential. Identification of 
these factors would be accelerated by the availability of relevant cDNA 
libraries constructed from hMSCs during various stages of their 

20 differentiation. 

Identification and sequencing of human genes is a major goal 
of modern Molecular Biology. For example, by identifying genes and 
determining their sequences, scientists have been able to make large 

25 quantities of valuable human "gene products." These include human 
insulin, interferon, Factor VIII, tumor necrosis factor, human growth 
hormone, tissue plasminogen activator, and numerous other compounds. 
Additionally, knowledge of gene sequences can provide the key to 
treatment or cure of genetic diseases (such as muscular dystrophy and 

3 0 cystic fibrosis). 
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BRIEF SUMMARY OF THE INVENTION 

5 

In accordance with the present invention, Mesenchymal stem 
cells (MSCs) have been isolated and culture expanded from humans, and 
from them new cDNA libraries have been constructed from messenger 
ribonucleic acids (hereafter, mRNAs) isolated from hMSCs. 

10 

It is an object of the present invention to obtain cDNA 
libraries from purified and cultured MSCs and to use these isolated nucleic 
acids, isolated sequences, and fragments thereof, in the determination 
and preparation of the expression products of these nucleic acids and 
15 sequences, including fragments thereof. 

tt is a further object of the present invention to use the 
cDNAs so produced, and fragments thereof, as well as their expression 
products, as chromosomal markers for determining the location of genes 
20 within the genome, and alleles thereof, expressed during the development 
of differentiated mesenchymal cells. 

It is yet another object of the present invention to provide 
DNA sequences for use in human "fingerprinting" whereby different 
25 individuals can be distinguished based on the sequences of the genes 
identified as wholly, or partly, identical to those disclosed herein. 

It is still another object of the present invention to provide 
polynucleotide sequences corresponding to the genes coding for 
3 0 polypeptides as disclosed herein whereby such sequences can be 
compared with those found in similar chromosomal locations in animals, 
especially mammals, and most especially humans, where such animal is 
3 
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affiicted with a disease affecting bone growth, or such other disease, or 
diseases, as may be affected by such genes, and thus detecting the 
presence of mutations in said genes leading to such diseases. 

5 It is a still further object of the present invention to provide 

genetically engineered cells, and vectors, containing one or more copies 
of the nucleic acids, or DNAs, or genes, or nucleotide sequences 
according to the present invention, capable of expressing said peptides, 
or polypeptides, or proteins for rapid cloning of genes according to the 
10 present invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

15 

Figure 1 shows the consensus sequence (SEQ ID NO: 27) for 
the novel DNA sequence of the invention as determined from different 
cDNA clones of said sequence, the latter being about 2.5 kb in length. 



20 Figure 2 is a deduced amino acid sequence for the protein 

expressed from the sequence of Figure 1, residues 125 through 1717 and 
corresponding to SEQ ID NO:29. The amino acids set off between 
asterisks constitute a bipartite nuclear localization signal. The isoelectric 
point and molecular weight were also calculated for the putative protein. 

25 

Figure 3 shows the results of a dot blot assay for the 
presence of the novel DNA sequence in a variety of human tissues. For 
this assay, a prefabricated dot blot from Clontech {#7770-1) was 
hybridized using a probe generated from the 2.5 kb cDNA of Figure 1 and 
3 0 treated according to the manufacturer's instructions. Signals due to 
bound probe were analyzed using a Storm 860 phosphorimager and 
imagequant software. 
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Figure 4 is a bar graph showing the distribution of the 
sequence of Figure 1 in a variety of human tissues based on relative 
mRNA abundance. The highest signal strength was in cells of adult heart 
and lowest was in fetal thymus. The bar graphs were generated using 
5 data from the dot blots of Figure 3 and were imported into an Excel 
spreadsheet. The data were then analyzed as arbitrary signal strength per 
tissue after subtracting background (due to non-specific hybridization). 
The order of the tissues in the bar graph reflects signal strength (and 
therefor differs from that on the dot biot of Figure 3). Figure 4(b) is a 
10 continuation of Figure 4(a). 



15 DETAILED DESCRIPTION OF THE INVENTION 

One aspect of the present invention is directed to nucleic acids and 
isolated DNA sequences and molecules, and fragments thereof (and 
corresponding isolated RNA sequences, and fragments thereof), including 

2 0 sequences complementary to the foregoing, showing sequence similarity 

to, or capable of hybridizing to, the DNA sequences identified in SEQ ID 
NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, and 27 or 28. The 
present invention is also directed to fragments or portions of such 
sequences which contain at least 1 5 bases, preferably at least 30 bases, 
25 more preferably at least 50 bases and most preferably at least 80 bases, 
and to those sequences which are at least 60%, preferably at least 80%, 
and most preferably at least 95%, especially 98%, identical thereto, and 
to DNA (or RNA) sequences encoding the polypeptides of SEQ ID NOS: 2, 
4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 29, including fragments 

3 0 and portions thereof and, when derived from natural sources, includes 

alleles thereof. 
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In accordance with the present invention, the term "percent 
identity" or "percent identical," when referring to a sequence, means that a 
sequence is compared to a claimed or described sequence after alignment 
of the sequence to be compared (the "Compared Sequence") with the 
5 described or claimed sequence (the "Reference Sequence"}. The Percent 
Identity is then determined according to the following formula: 

Percent identity = 100 [1-(C/R)] 

10 

wherein C is the number of differences between the Reference Sequence 
and the Compared Sequence over the iength of alignment between the 
Reference Sequence and the Compared Sequence wherein (i) each base or 
amino acid in the Reference Sequence that does not have a corresponding 

15 aligned base or amino acid in the Compared Sequence and (ii) each gap in 
the Reference Sequence and (iii) each aligned base or amino acid in the 
Reference Sequence that is different from an aligned base or amino acid in 
the Compared Sequence, constitutes a difference; and R is the number of 
bases or amino acids in the Reference Sequence over the length of the 

20 alignment with the Compared Sequence with any gap created in the 
Reference Sequence also being counted as a base or amino acid. 

If an alignment exists between the Compared Sequence and 
the Reference Sequence in which the percent identity as calculated above is 
25 about equal to or greater than a specified minimum Percent identity then the 
Compared Sequence has the specified minimum percent identity to the 
Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
identity. 

30 

Yet another aspect of the present invention is directed to an 
isolated DNA (or RNA) sequence or molecule comprising at least the 

6 
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coding region of a human gene (or a DNA sequence encoding the same 
polypeptide as such coding region), in particular an expressed human 
gene, which human gene comprises a DNA sequence homologous with, 
or contributing to, the sequence depicted in SEQ ID NOS: 1, 3, 5, 7, 9, 
5 1 1, 13, 15, 17, 19, 21, 23, 25 and 27 or 28, or one at least 60%, 
preferably at least 80%, and most preferably at least 95%, especially 
98%, identical thereto, including 100% identity, as well as fragments or 
portions of the coding region which encode a polypeptide having a similar 
function to the polypeptide encoded by said coding region. Thus, the 
10 isolated DNA (or RNA) sequence may include only the coding region of 
the expressed gene (or fragment or portion thereof as hereinabove 
indicated) or may further include all or a portion of the non-coding DNA 
(or RNA) of the expressed human gene. 

15 In general, sequences homologous with and contributing to 

the sequences of SEQ ID NOS: 1,3,5,7,9, 11, 13, 15, 17, 19, 21 , 23, 
25 and 27 or 28 (or one at least 60%, preferably at least 80%, and most 
preferably at least 95% identical or homologous thereto) are from the 
coding region of a human gene. 

20 

The present invention also relates to vectors or plasmids 
which include such DNA (or RNA) sequences, as well as the use of the 
DNA (or RNA) sequences. 

25 The sequences depicted in SEQ ID NOS: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 

17, 19, 21, 23, 25 and 28 are hybridizable with actual DNA and RNA 
sequences as derived from different human tissues. These sequences 
represent cDNA clones. 

3 0 The sequence depicted in Figure 1 (SEQ ID NO: 27) is hybridizable 

with actual DNA and RNA sequences as derived from different human 
tissues. A number of cDNA clones have been generated. The nucleotide 

7 
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sequence of Figure 1 (SEQ ID NO: 27) itself showed a nuclear location in 
the various tissues studied. The distribution of this sequence in various 
human tissues is shown in Figures 3 and 4. Some of these clones had an 
additional 3 '-untranslated region, the presence of which is generally related 
5 to the extent to which the mRNA species remain in the cell before being 
turned over. See Kingman, Genetic Engineering , Blackwell, 1988, at page 
313. The 3 '-untranslated region may also regulate the frequency at which 
the mRNA is translated and thus constitute a mechanism by which the 
expression of the protein can be regulated. (Gray, N.K. & Wickens, M., 
10 Control of Translation Initiation in Animals, Ann. Rev. Cell Dev. Biol. , 
14:399-458 (1998). 

The polynucleotides of the present invention may be in the 
form of RNA or in the form of DNA, which DNA includes cDNA, genomic! 
15 DNA, and synthetic DNA. The DNA may be double-stranded or single- 
stranded, and if single stranded may be the coding strand or non-coding 
(anti-sense) strand. The coding sequence which encodes the mature 
polypeptide may be identical to the coding sequences present as open 
reading frames (ORFs) of the spolynucleotide sequences disclosed herein or 

2 0 may be a different coding sequence, which coding sequence, as a result of 

the redundancy or degeneracy of the genetic code, encodes the same 
mature polypeptide as the polynucleotide sequences of SEQ ID NOS: 1, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25 and 28. 

25 The polynucleotides that code for the polypeptides disclosed 

herein as putative proteins SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26 and 29 may include, but are not limited to: only the coding 
sequence for the mature polypeptide; the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory 

3 0 sequence, a proprotein sequence and a membrane anchor; the coding 

sequence for the mature polypeptide (and optionally additional coding 
sequence) and non-coding sequence, such as introns or non-coding 
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sequence 5' and/or 3' of the coding sequence for the mature polypeptide. 

The polynucleotide which codes for the polypeptide of Figure 2 
(SEQ ID NO:29) may include, but is not limited to: only the coding 
"5 sequence for the mature polypeptide; the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory 
sequence, a proprotein sequence and a membrane anchor; the coding 
sequence for the mature polypeptide (and optionally additional coding 
sequence) and non-coding sequence, such as introns or non-coding 
10 sequence 5' and/or 3' of the coding sequence for the mature polypeptide. 

The term "polynucleotide" as used for the present invention 
encompasses a polynucleotide which includes only coding sequence for the* 
15 polypeptide as well as a polynucleotide which includes additional coding 
and/or non-coding sequences. 

The present invention further relates to variants of the 
hereinabove described polynucleotides which encode fragments, analogs 

2 0 and derivatives of the polypeptides having the amino acid sequences of 

SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26 and 29. 
Variants of the polynucleotide may be naturally occurring allelic variants of 
the polynucleotides or a non-naturally occurring variant of the 
polynucleotides. 

25 

Thus, the nucleic acids, or polynucleotides, according to the 
present invention may have coding sequences which are naturally occurring 
allelic variants of the coding sequence shown in SEQ ID NOS: 1 , 3, 5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27 and 28. As known in the art, an 

3 0 allelic variant is an alternate form of a polynucleotide sequence which may 

have a substitution, deletion or addition of one or more nucleotides, which 
does not substantially alter the function of the encoded polypeptide. 

9 



WO 00/59933 



PCT/US00/08751 



The present invention also includes polynucleotides, wherein 
the coding sequence for the mature polypeptide may be fused in the same 
reading frame to a polynucleotide sequence which aids in expression and 
secretion of a polypeptide from a host ceil, for example, a leader sequence 
5 which functions as a secretory sequence for controlling transport of a 
polypeptide from the cell and a transmembrane anchor which facilitates 
attachment of the polypeptide to a cellular membrane. The polypeptide 
having a leader sequence is a preprotein and may have the leader sequence 
cleaved by the host cell to form the mature polypeptide. The 
10 polynucleotides may also encode for a proprotein which is the mature 
protein plus additional 5' amino acid residues. A mature protein having a 
prosequence is a proprotein and is often an inactive form of the protein. 
Once the prosequence is cleaved an active mature protein remains. 

15 T hus, for example, the polynucleotide of the present invention 

may encode for a mature protein, for a protein having a prosequence, for a 
protein having a transmembrane anchor or for a polypeptide having a 
prosequence, a presequence (leader sequence) and a transmembrane 
anchor. 

20 

The polynucleotides of the present invention may also have 
the coding sequence fused in frame to a marker sequence which allows for 
purification of the polypeptide of the present invention. The marker 
sequence may be a hexa-histidine tag supplied by a pQE-9 vector to provide 
25 for purification of the mature polypeptide fused to the marker in the case of 
a bacterial host, or, for example, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. 
The HA tag corresponds to an epitope derived from the influenza 
hemagglutinin protein (Wilson, I., et al., Cell, 37:767 (1984)). 

30 

Fragments of the full length polynucleotide of the present 
invention may be used as hybridization probes for a cDNA library to isolate 
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the full length cDNA and to isolate other cDNAs which have a high 
sequence similarity to the gene or similar biological activity. Probes of this 
type preferably have at least 15 bases, may have at least 30 bases and 
even 50 or more bases. The probe may also be used to identify a cDNA 
5 clone corresponding to a full length transcript and a genomic clone or clones 
that contain the complete gene including regulatory and promotor regions, 
exons, and introns. An example of a screen comprises isolating the coding 
region of the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a sequence 
10 complementary to that of the gene of the present invention are used to 
screen a library of human cDNA, genomic DNA or mRNA to determine 
which members of the library the probe hybridizes to. 



A polynucleotide according to the present invention may have" 
15 at least 1 5 bases, preferably at least 30 bases, and more preferably at least 
50 bases which hybridize to a polynucleotide of SEQ ID NOS: 1,3,5, 7, 9, 
11, 13, 15, 17, 19, 21, 23, 25, 27 and 28 and which has an identity 
thereto, as hereinabove described, and which may or may not retain 
activity. Such polynucleotides may be employed as probes for the 
20 polynucleotides or genes coding for the polypeptides of SEQ ID NOS: 2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 29, for example, for 
recovery of the polynucleotide or as a diagnostic probe or as a PCR primer. 



The polynucleotides according to the present invention may 
25 also occur in the form of mixtures of polynucleotides hybridizable to some 
extent with the gene sequences containing any of the nucleotide sequences 
of SEQ ID NOS: 1,3, 5,7,9, 11, 13, 15, 17, 1 9, 21 , 23, 25, 27 and 28 
including any and all fragments thereof, and which polynucleotide mixtures 
may be composed of any number of such polynucleotides, or fragments 
3 0 thereof, including mixtures having at least 10, perhaps at least 30 such 
sequences, or fragments thereof. 
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Because coding regions comprise only a smaii portion of the 
human genome, identification and mapping of transcribed regions and 
coding regions of chromosomes is of significant interest. There is a 
corresponding need for reagents for identifying and marking coding 
5 regions and transcribed regions of chromosomes. Furthermore, such 
human sequences are valuable for chromosome mapping, human 
identification, identification of tissue type and origin, forensic 
identification, and locating disease-associated genes (i.e., genes that are 
associated with an inherited human disease, whether through mutation, 
10 deletion, or faulty gene expression) on the chromosome. 

Various aspects of the present invention include each of the 
individual sequences, corresponding partial and complete cDNAs, genomic 
DNA, mRNA, antisense strands, PCR primers, coding regions, and 
15 constructs. Expression vectors and polypeptide expression products, are 
also within the scope of the present invention, along with antibodies, 
especially monoclonal antibodies, to such expression products. 

As used herein and except as noted otherwise, all terms are 
20 defined as given below. 

In accordance with the present invention, the term "gene" or 
"cistron" means the segment of DNA {or DNA segment) involved in 
producing a polypeptide chain; it includes regions preceding and following 

25 the coding region (5'-and 3'- untranslated regions, or UTRs, also called 
leader and trailer sequences, regions, or segments) as well as intervening 
sequences (introns) between individual coding segments (exons), which 
intronic regions are typically removed during processing of post- 
transcriptional RNA to form the final translatable mRNA product. Of 

30 course, by their nature, cDNAs contain no intronic sequences. 
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In accordance with the present invention, the term "DNA 
segment" refers to a DNA polymer, in the form of a separate fragment or 
as a component of a larger DNA construct, which has been derived from 
DNA isolated at least once in substantially pure form, i.e., free of 
5 contaminating endogenous materials and in a quantity or concentration 
enabling identification, manipulation, and recovery of the segment and its 
component nucleotide sequences by standard biochemicai methods, for 
example, using a cloning vector. Such segments are provided in the form 
of an open reading frame uninterrupted by internal nontranslated 
10 sequences (introns), which are typically present in eukaryotic genes. 
Sequences of non-translated DNA may be present downstream from the 
open reading frame, where the same do not interfere with manipulation or 
expression of the coding regions. 

15 The nucleic acids and polypeptide expression products 

disclosed according to the present invention, as well as expression 
vectors containing such nucleic acids, may be in "enriched form." As 
used herein, the term "enriched" means that the concentration of the 
material is at least about 2, 5, 10, 100, or 1000 times its natural 

20 concentration (for example), advantageously 0.01%, by weight, 
preferably at least about 0.1% by weight. Enriched preparations of about 
0.5%, 1%, 5%, 10%, and 20% by weight are also contemplated. The 
sequences, constructs, vectors, clones, and other materials comprising 
the present invention can advantageously be in enriched or isolated form. 

25 For example, removal, via the differential display techniques described 
herein, of clones corresponding to ribosomal RNA and "housekeeping" 
genes and clones without human cDNA inserts results in a library that is 
"enriched" in the desired clones. 

3 0 The DNA and RNA sequences, and polypeptides, disclosed in 

accordance with the present invention will commonly be in isolated form. 
The term "isolated" means that the material is removed from its original 
13 
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environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally-occurring polynucleotide, or DNA, present in a living 
animal is not isolated, but the same polynucleotide or DNA, separated 
from some or all of the coexisting materials in the natural system, is 
5 isolated. Such DNA could be part of a vector and/or such polynucleotide 
could be part of a composition, and still be isolated in that such vector or 
polynucleotide is not part of its natural environment. 

The DNA and RNA sequences, and polypeptides, disclosed 

10 in accordance with the present invention may also be in "purified" form. 
The term "purified" does not require absolute purity; rather, it is intended 
as a relative definition, and can include preparations that are highly 
purified or preparations that are only partially purified, as those terms are 
understood by those of skill in the relevant art. Individual clones isolated 

15 from a cDNA library have been conventionally purified to electrophoretic 
homogeneity. The cDNA clones are obtained via manipulation of a 
partially purified naturally occurring substance (messenger RNA). By 
conversion of mRNA into a cDNA library, pure individual cDNA clones can 
be isolated from the synthetic library by clonal selection. Thus, creating a 

20 cDNA library from RNA and subsequently isolating individual clones from 
that library results in an approximately 10 6 fold purification of the native 
message. Purification of starting material or natural material to at least 
one order of magnitude, preferably two or three orders, and more 
preferably four or five orders of magnitude is expressly contemplated. 

25 Furthermore, claimed polynucleotide which has a purity of preferably 
0.001 %, or at least 0.01 % or 0.1 %; and even desirably 1 % by weight or 
greater is expressly contemplated. 

The term "coding region" refers to that portion of a human 
3 0 gene which either naturally or normally codes for the expression product 
of that gene in its natural genomic environment, i.e., the region coding in 
vivo for the native expression product of the gene. The coding region can 
14 
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be from a normal, mutated or altered gene, or can even be from a DNA 
sequence, or gene, wholly synthesized in the laboratory using methods 
well known to those of skill in the art of DNA synthesis. 

5 in accordance with the present invention, the term 

"nucleotide sequence" refers to a heteropoiymer of deoxyribonucleotides. 
Generally, DNA segments encoding the proteins provided by this invention 
are assembled from cDNA fragments and short oligonucleotide linkers, or 
from a series of oligonucleotides, to provide a synthetic gene which is 
10 capable of being expressed in a recombinant transcriptional unit 
comprising regulatory elements derived from a microbial or viral operon. 

The term "expression product" means that polypeptide or 
protein that is the natural transcription product of the gene and any 
15 nucleic acid sequence coding equivalents resulting from genetic code 
degeneracy and thus coding for the same amino acid(s). 

The term "fragment" when referring to a coding sequence 
means a portion of DNA comprising less than the complete human coding 
20 region whose expression product retains essentially the same biological 
function or activity as the expression product of the complete coding 
region. 

When referring to a portion of a polypeptide, as used herein, 
25 the terms "portion," "segment," and "fragment," refer to a continuous 
sequence of residues, such as amino acid residues, which sequence forms a 
subset of a larger sequence. For example, if a polypeptide were subjected to 
treatment with any of the common endopeptidases, such as trypsin or 
chymotrypsin, the oligopeptides resulting from such treatment would 
30 represent portions, segments or fragments of the starting polypeptide. 
Similarly, portions, segments or fragments of polynucieotides would include 
those products resulting from the treatment of such polynucieotides with 
15 
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endonucieases. 

The term "primer" means a short nucleic acid sequence that 
is paired with one strand of DNA and provides a free 3'OH end at which a 
5 DNA polymerase starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in 
binding of RNA polymerase to initiate transcription. 

10 The term "open reading frame (ORF)" means a series of 

triplets coding for amino acids without any termination codons and is a 
sequence (potentially) translatable into protein. 

The term "exon" means any segment of an interrupted gens' 
15 that is represented in the mature RNA product. 

As used herein, reference to a DNA sequence includes both 
single stranded and double stranded DNA. Thus, the specific sequence, 
unless the context indicates otherwise, refers to the singie strand DNA of 

2 0 such sequence, the duplex of such sequence with its complement {double 

stranded DNA) and the complement of such sequence. 

In accordance with the present invention, the overall 
approach to identification of cDNAs from hMSCs involved measurement 
25 of gene expression during growth of human mesenchymal stem cells in 
culture. Cells were harvested and the total RNA content thereof was 
recovered. Next, using various primer combinations, reverse transcriptase 
and polymerase chain reaction procedures (RT-PCR) were used to produce 
and amplify the corresponding cDNAs, which were then screened to find 

3 0 regulated DNA sequences that were subsequently purified and cloned. 

These clones were then sequenced and used to determine a consensus 
sequence (one based upon the most commonly occurring bases at each 
16 
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nucleotide positron in a sequence after the contributing sequences are 
aligned by residue position). The resulting sequences were then subjected 
to computer database searches for novelty, and any homology with 
known sequences, using, for example, the BLAST program and the 
5 GenBank database. 

Using the RT-PCR methodology, the mRNA from the cells of 
interest (such as the hMSCs used in accordance with the present 
invention) is used to prepare a set or family of cDNAs corresponding to 

10 the expressed genes of the cell. This cDNA preparation is then 
exhaustively hybridized with mRNA of cells not expressing the gene and 
resulting in removal of all sequences from the cDNA preparation that are 
common to the two cell samples. All of the cDNA sequences that 
hybridize with the other mRNA and those that remain are then hybridized 

15 with mRNA from the cells expressing the gene (for example, cells from a 
healthy person or cells from tissues known to express the gene) to 
confirm that they are in fact the desired coding sequences. Because these 
latter clones contain sequences specific to the mRNA population of the 
cells of interest, they can subsequently be amplified and characterized 

20 using further rounds of PCR and the general techniques of molecular 
biology. 

In accordance with the foregoing, a cDNA library was 
generated and corresponds to the sequences of SEQ ID NOS: 1, 3, 5, 7, 

25 9, 11, 13, 15, 17, 19, 21, 23, 25, 27 and 28. Probes based on these 
cDNAs can be used to identify the relevant transcripts, using Northern 
Blotting Analysis methods well known in the art to localize these 
sequences within cells of various tissues. For example, the heaviest 
distribution of the gene coding for the polypeptide of Figure 2 (SEQ ID 

3 0 NO: 29) was in heart tissue, as shown in Figures 3 and 4. 
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In accordance with the present invention, cDNA was 
quantified by spotting 0.5 |nl aiiquots of standards and samples on 
ethidium agarose pfates prepared as suggested in the instructions from 
the manufacturer (Stratagene, La Jolla, CA). Plates were incubated at 
5 room temperature for 15 minutes and DNA was visualized by UV 
transillumination. The respective cDNAs were then quantified by 
comparing spot intensities of the samples with those of the standards 
(the latter consisting of appropriate dilutions of 1 kb ladders (from Life 
Technology). 

10 

Aiiquots of each amplified library were excised and plasmids 
from randomly chosen colonies were analyzed by restriction nuclease 
analysis. In accordance with the present invention, plasmid DNA was 
digested with both EcoRI and Xhol nucleases (New England Biolabs) and 
15 the resulting restriction fragments were separated on 1.5% agarose gel 
electrophoresis. The cDNA inserts ranged in size from less than 1 kbp to 
larger than 4 kbp (where 1 kbp = 1 ,000 nucleotide base pairs of duplex 
DNA). 

2 0 Each of the DNA sequences identified herein (and the 

corresponding complete gene sequences) can be used in numerous ways 
as polynucleotide reagents. The sequences can be used as diagnostic 
probes for the presence of a specific mRNA in a particular cell type as 
well as in genetic linkage analysis (polymorphisms). Further, the 

25 sequences can be used as probes for locating gene regions associated 
with genetic disease. 

The nucleotide and gene sequences of the present invention 
are also valuable for chromosome identification. Each sequence is 
30 specifically targeted to and can hybridize with a particular location on an 
individual human chromosome. Moreover, there is a current need for 
identifying particular sites on the chromosome. The mapping of the 
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polynucleotides to specific chromosomes according to the present 
invention is an important first step in correlating those sequences with 
genes associated with disease. 

5 Briefly, sequences can be mapped to chromosomes by 

preparing PCR primers (preferably 15-30 bp) from the sequences 
disclosed herein. Computer analysis of these sequences is used to rapidly 
select primers that do not span more than one exon in the corresponding 
genomic DNA, which would otherwise complicate the amplification 
10 process. These primers are then used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the sequences or 
subsequences disclosed herein will yield an amplified fragment. 

PCR mapping of somatic cell hybrids is a rapid procedure for 
assigning a particular sequence to a particular chromosome. Three or 
more clones can be assigned per day using a single thermal cycler, as is 
well known in the art. Using the present invention with the same 
oligonucleotide primers, sublocaiization can be achieved with panels of 
fragments from specific chromosomes or pools of large genomic clones in 
an analogous manner. Other mapping strategies that can similarly be 
used to map a sequence, or part of a sequence, to its chromosome 
include in situ hybridization, prescreening with labeled flow-sorted 
chromosomes and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clone to 
a metaphase chromosomal spread can be used to provide a precise 
chromosomal location in one step. This technique can be used with 
30 cDNA as short as 500 or 600 bases; however, clones larger than 2,000 
bp have a higher likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. FISH requires use of 
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the clone from which the sequence was derived, and the longer the 
better. For example, 2,000 bp is good, 4,000 is better, but more than 
4,000 is probably not necessary to get good results a reasonable 
percentage of the time. For a review of this technique, see Verma et al., 
5 Human Chromosomes: a Manual of Basic Techniques . Pergamon Press, 
New York (1988). 

Reagents for chromosome mapping can be used individually 
(to mark a single chromosome or a single site on that chromosome) or as 
10 panels of reagents (for marking multiple sites and/or multiple 
chromosomes). Reagents corresponding to noncoding regions of the 
genes actually are preferred for mapping purposes. Coding sequences are 
more likely to be conserved within gene families, thus increasing the 
chance of cross hybridizations during chromosomal mapping. 

15 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. (Such data are 
found, for example, in V. McKusick, Mendelian Inheritance in Man 
2 0 (available on line through Johns Hopkins University Welch Medical 
Library)). The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through 
linkage analysis (coinheritance of physically close genes). 

25 Next, it is necessary to determine if there are differences in 

the cDNA or genomic sequence between affected and unaffected 
individuals. If a mutation is observed in some or all of the affected 
individuals but not in any norma! individuals, then the mutation is likely to 
be the causative agent of the disease. 

30 



With current resolution of physical mapping and genetic 
mapping techniques, a cDNA precisely localized to a chromosomal region 
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associated with the disease could be one of between 50 and 500 
potential causative genes. (This assumes 1 megabase mapping resolution 
and one gene per 20 kb.) 

5 Comparison of affected and unaffected individuals generally 

involves first looking for structural alterations in the chromosomes, such 
as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR based on that cDNA sequence. Ultimately, 
complete sequencing of genes from several individuals is required to 
10 confirm the presence of a mutation and to distinguish mutations from 
polymorphisms. 

In addition to the foregoing, the sequences of the invention, 
as broadly described, can be used to control gene expression through* 

15 triple helix formation or antisense DNA or RNA, both of which methods 
are based on binding of a polynucleotide sequence to DNA or RNA. 
Polynucleotides suitable for use in these methods are usually 20 to 40 
bases in length and are designed to be complementary to a region of the 
gene involved in transcription (triple helix - see Lee et al, Nucl. Acids Res., 

2 0 6:3073 (1979); Cooney et al, Science, 241 :456 (1988) ; and Dervan et 
ai, Science, 251 : 1360 (1991) ) or to the mRNA itself (antisense - Okano, 
J. Neurochem., 56:560 (1991) ; Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple 
helix- formation optimally results in a shut-off of RNA transcription from 

2 5 DNA, while antisense RNA hybridization blocks translation of an mRNA 

molecule into polypeptide. Both techniques have been demonstrated to 
be effective in mode! systems. Information contained in the sequences of 
the present invention is necessary for the design of an antisense or triple 
helix oligonucleotide. Antisense RNA or oligonucleotide hybridization may 

3 0 also lead to RNAse H activation and hence destruction of the molecules 

involved in the hybrid. 
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The present invention is also a useful tool in gene therapy, 
which requires isolation of the disease-associated gene in question as a 
prerequisite to the insertion of a normal gene into an organism to correct 
a genetic defect. The high specificity of the cDNA probes according to 
5 this invention have promise of targeting such gene locations in a highly 
accurate manner. 

The sequences of the present invention, as broadly defined, 
and including subsequences and fragments thereof, are also useful for 

10 identification of individuals from minute biological samples. The United 
States military, for example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its personnel. In this 
technique, an individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands* 

15 for identifying personnel. This method does not suffer from the current 
limitations of "Dog Tags" which can be lost, switched, or stolen, making 
positive identification difficult. The sequences of the present invention 
are useful as additional DNA markers for RFLP. 

2 0 However, RFLP is a pattern based technique, which does not 

require the DNA sequence of the individual to be sequenced. Portions of 
the sequences of the present invention can be used to provide an 
alternative technique that determines the actual base-by-base DNA 
sequence of selected portions of an individual's genome. These 
25 sequences can also be used to prepare PCR primers for amplifying and 
isolating such selected DNA. One can, for example, take part of the 
sequence of the invention and prepare two PCR primers from the 5' and 
3' ends of the sequence, or fragment of the sequence. These are used to 
amplify an individual's DNA, corresponding to the sequence. The 

3 0 amplified DNA is sequenced. 
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Panels of corresponding DNA sequences from individuals, 
made this way, can provide unique individual identifications, as each 
individual wiil have a unique set of such DNA sequences, due to allelic 
differences. The sequences of the present invention can be used to 
5 particular advantage to obtain such identification sequences from 
individuals and from tissue. Allelic variation occurs to some degree in the 
coding regions of these sequences, and to a greater degree in the 
noncoding regions. It is estimated that allelic variation between individual 
humans occurs with a frequency of about once per each 500 bases. 

10 Each of the fragments or complete coding sequences comprising a part of 
the present invention can, to some degree, be used as a standard against 
which DNA from an individual can be compared for identification 
purposes. Because greater numbers of polymorphisms occur in the 
noncoding regions, fewer sequences are necessary to differentiate* 

15 individuals. 

If a panel of reagents from the sequences according to the 
present invention is used to generate a unique ID database for an 
individual, those same reagents can later be used to identify tissue from 

2 0 that individual. Positive identification of that individual, living or dead can 

be made from extremely small tissue samples. 

Another use for DNA-based identification techniques is in 
forensic biology. PCR technology can be used to amplify DNA sequences 
25 taken from very small biological samples. In one prior art technique, gene 
sequences are amplified at specific loci known to contain a large number 
of allelic variations, for example the DQa class II HLA gene (Eriich, H., 
PCR Technology, Freeman and Co. (1992)). Once this specific area of 
the genome is amplified, it is digested with one or more restriction 

3 0 enzymes to yield an identifying set of bands on a Southern blot probed 

with DNA corresponding to the DQa class II HLA gene. In accordance 
with the present invention, it is clear from the results depicted in Figure 3 
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and 4 that the novel gene signal according to the present invention is 
found in many different tissues of the body. 

The sequences of the present invention can be used to 
5 provide polynucleotide reagents specifically targeted to additional loci in 
the human genome, and can enhance the reliability of DNA-based 
forensic identifications. Those sequences targeted to noncoding regions 
are particularly appropriate. As mentioned above, actual base sequence 
information can be used for identification as an accurate alternative to 
io patterns formed by restriction enzyme generated fragments. Reagents 
for obtaining such sequence information are within the scope of the 
present invention. Such reagents can comprise complete genes, parts of 
genes or corresponding coding regions, or fragments of at least 1 5 bp, 
preferably at least 18 bp. 

15 

There is also a need for reagents capable of identifying the 
source of a particular tissue. Such need arises, for example, in forensics 
when presented with tissue of unknown origin. Appropriate reagents can 
comprise, for example, DNA probes or primers specific to particular tissue 

2 0 prepared from the sequences of the present invention. Panels of such 

reagents can identify tissue by species and/or by organ type. In a similar 
manner, these reagents can be used to screen tissue cultures for 
contamination. 

25 Sequences that match perfectly to several different genes 

can be detected by hybridizing to chromosomes: if many chromosomal 
loci are observed, the sequence (or a close variant) is in more than one 
gene. This problem can be circumvented by using the 3'-untransiated 
part of the cDNA alone as a probe for the chromosomal location or for the 

3 0 full-length cDNA or gene. The 3'-untranslated region is more likely to be 

unique within gene families, since there is no evolutionary pressure to 
conserve a coding function of this region of the mRNA. 

24 



WO 00/59933 



PCT/USOO/08751 



The cDNA libraries disclosed according to the present 
invention ideally use directional cloning methods so that either the 5' end 
of the cDNA (likely to contain coding sequence) or the 3' end (likely to 
5 be a non-coding sequence) can be selectively obtained. 

Using the sequence information provided herein, the 
polynucleotides of the present invention can be derived from natural 
sources or synthesized using known methods. The sequences falling 

10 within the scope of the present invention are not limited to the specific 
sequences described, but include human allelic and species variations 
thereof. Allelic variations can be routinely determined by comparison of 
one sequence with a sequence from another individual of the same 
species. Furthermore, to accommodate codon variability, the invention* 

15 includes sequences coding for the same amino acid sequences as do the 
specific sequences disclosed herein. In other words, in a coding region, 
substitution of one codon for another which encodes the same amino acid 
is expressly contemplated. (Coding regions can be determined through 
routine sequence analysis.) 

20 

In a cDNA library there are many species of mRNA 
represented. Each cDNA clone can be interesting in its own right, but 
must be isolated from the library before further experimentation can be 
completed. In order to sequence any specific cDNA, it must be removed 

25 and separated (i.e. isolated and purified) from all the other sequences. 
This can be accomplished by many techniques known to those of skill in 
the art. These procedures normally involve identification of a bacterial 
colony containing the cDNA of interest and further amplification of that 
bacteria. Once a cDNA is separated from the mixed clone library, it can 

3 0 be used as a template for further procedures such as nucleotide 
sequencing. 
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The present invention aiso includes recombinant constructs 
comprising one or more of the sequences as broadly described above. 
The constructs comprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a forward or 
5 reverse orientation. in a preferred aspect of this embodiment, the 
construct further comprises regulatory sequences, including for example, 
a promoter, operably linked to the sequence. Large numbers of suitable 
vectors and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided by way of 
10 example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, 
pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, 
pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, 
pOG44, pXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

15 

Thus, the present invention is not restricted to such 
constructs or sequences alone but also includes expression vehicles, 
which may include plasmids, viruses, or any other expression vectors, 
including cells and liposomes, containing any of the nucleic acids, 

2 0 nucleotide sequences, DNAs, RNAs, or fragments thereof, as disclosed 

according to the present invention. Furthermore, this will be true 
regardless of whether such sequences are coding sequences or non- 
coding sequences and whether such coding sequences code for all or part 
of the expression products as disclosed herein, so long as such 
25 expression products, or fragments thereof, exhibit some utility in keeping 
with the invention disclosed herein. Thus, while the present invention 
includes an isolated DNA sequence, or nucleic acid, that expresses a 
human protein when in a suitable expression system, for example, a cell- 
free, or in vitro, expression system, such system may also be contained 

3 0 in, or part of, a suitable expression vehicle, or vector, be that a cell, a 

plasmid, a virus, or other operative expression vector. 
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Such expression systems, especially where part of an 
expression vehicle, will commonly require some promoter region that may 
include a promoter different from that normally associated in vivo with 
the genes coding for the gene expression products and proteins disclosed 
5 according to the present invention. Promoter regions can be selected from 
any desired gene using CAT (chloramphenicol transferase) vectors or 
other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lacl, 
lacZ, T3, T7, gpt, lambda P R , and trc. Eukaryotic promoters include CMV 
10 immediate early, HSV thymidine kinase, early and late SV40, LTRs from 
retrovirus, and mouse metallothionein-l. Selection of the appropriate 
vector and promoter is weli within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to* 
15 host cells containing the above-described construct(s). The host cell can 
be a higher eukaryotic cell, such as a mammalian ceil, or a lower 
eukaryotic cell, such as a yeast cell, or the host ceil can be a procaryotic 
cell, such as a bacterial cell. Introduction of the construct into the host 
cell can be effected by calcium phosphate transfection, DEAE, dextran 
2 0 mediated transfection, or eiectroporation (Davis, L., Dibner, M., Battey, I., 
Basic Methods in Molecular Biology, 1 986)) . 

The constructs in host cells can be used in a conventional 
manner to produce the gene product coded by the recombinant sequence. 

2 5 Alternatively, the encoded polypeptide, once the sequence is known from 

the cDNAs, or from isolation of the pure product, can be synthetically 
produced by conventional methods of peptide synthesis, either manual or 
automated. 

3 0 Thus, in accordance with the present invention, once the 

coding sequence is known, or the gene is cloned which encodes the 
polypeptide, conventional techniques in molecular biology can be used to 
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obtain the polypeptide. More generally, the present invention includes alt 
polypeptides coded for by any and each of the DNA or RNA sequences 
disclosed herein, including fragments of said polypeptides, as well as 
derivatives and functional analogs thereof. 

5 

At the simplest level, the amino acid sequence can be 
synthesized using commercially available peptide synthesizers. This is 
particularly useful in producing small peptides and fragments of larger 
polypeptides. (Fragments are useful, for example, in generating antibodies 
10 against the native polypeptide.) 

Alternatively, the DNA encoding the desired polypeptide can 
be inserted into a host organism and expressed. The organism can be a 
bacterium, yeast, eel! line, or multicellular plant or animal. The literature* 

15 is replete with examples of suitable host organisms and expression 
techniques. For example, polynucleotide {DNA or mRNA) can be injected 
directly into muscle tissue of mammals, where it is expressed. This 
methodology can be used to deliver the polypeptide to the animal, or to 
generate an immune response against a foreign polypeptide. Wolff, et a!., 

20 Science , 247 :1465 (1990); Feigner, et a!.. Nature , 349 :351 (1991). 
Alternatively, the coding sequence, together with appropriate regulatory 
regions (i.e., a construct), can be inserted into a vector, which is then 
used to transfect a cell. The cell (which may or may not be part of a 
larger organism) then expresses the polypeptide. 

25 

The present invention further relates to polypeptides having an 
amino acid sequence selected from SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, and 29, as well as fragments, analogs and 
derivatives of such polypeptide. 

30 

The terms "fragment," "derivative" and "analog," when 
referring to the polypeptides disclosed herein also mean polypeptides that 
28 
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retain essentially the same biological function or activity as said 
polypeptides. Thus, an analog includes a proprotein which can be activated 
by cleavage of the proprotein portion to produce an active mature 
polypeptide. Such fragments, derivatives and analogs must have sufficient 
5 similarity to the polypeptides disclosed herein so that activity of the native 
polypeptide is retained. 

The polypeptides of the present invention may be recombinant 
polypeptides, natural polypeptides or synthetic polypeptides, preferably 
10 recombinant polypeptides. 

"Recombinant," as used herein, means that a protein is 
derived from recombinant (e.g., microbial or mammalian) expression 
systems. "Microbial" refers to recombinant proteins made in bacterial or 

15 fungal (e.g., yeast) expression systems. As a product, "recombinant 
microbial" defines a protein essentially free of native endogenous 
substances and unaccompanied by associated native glycosylation. 
Protein expressed in most bacterial cultures, e.g., E. coli , will be free of 
glycosylation modifications; protein expressed in yeast will have a 

20 glycosylation pattern different from that expressed in mammalian cells. 

The fragment, derivative or analog of a polypeptide of SEQ ID 
NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26 and 29 may be (i) 
one in which one or more of the amino acid residues are substituted with a 

25 conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue) and such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (it) one in which one or more of 
the amino acid residues includes a substituent group, or (iii) one in which 
the mature polypeptide is fused with another compound, such as a 

3 0 compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol), or (iv) one in which the additional amino acids are 
fused to the mature polypeptide, such as a leader or secretory sequence or 
29 
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a sequence which is employed for purification of the mature polypeptide or 
a proprotein sequence. Such fragments, derivatives and analogs are deemed 
to be within the abilities of those skilled in the art in view of the teachings 
herein. 

5 

The polypeptides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
When applied to polypeptides, the term "isolated" has its already stated 
meaning. 

10 

The polypeptides of the present invention include the 
polypeptides of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, and 29, in particular the mature polypeptide, as well as polypeptides 
which have at least 70% identity to these polypeptides, or which have, at 
15 least 90% identity to these polypeptides, still more preferably at least 95% 
identity to these polypeptides and also include portions of such polypeptides 
with such portion generally containing at least 30 amino acids and more 
preferably at least 50 amino acids. 

20 Fragments or portions of the polypeptides of the present 

invention may be employed for producing the corresponding full-length 
polypeptide by peptide synthesis; therefore, the fragments may be 
employed as intermediates for producing the full-length polypeptides. 
Fragments or portions of the polynucleotides of the present invention may 

25 be used to synthesize full-length polynucleotides of the present invention. 

The present invention also relates to vectors which include 
polynucleotides of the present invention, host cells which are genetically 
engineered with vectors of the invention and the production of polypeptides 
30 of the invention by recombinant techniques. 

Host cells are genetically engineered (transduced or 
30 
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transformed or transfected) with the vectors of this invention which may 
be, for example, a cloning vector or an expression vector, either of which 
may be in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host ceils can be cultured in conventional nutrient media 
5 modified as appropriate for activating promoters, selecting transformants or 
amplifying the genes of the present invention. The culture conditions, such 
as temperature, pH and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

10 The polynucleotides of the present invention may be employed 

for producing polypeptides by recombinant techniques. Thus, for example, 
the polynucleotide may be included in any one of a variety of expression 
vectors for expressing a polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 

15 bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived 
from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector 
may be used as long as it is repiicable and viable in the host. 

2 0 In accordance with the present invention, an appropriate DNA 

sequence or segment may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into the appropriate 
restriction endonuclease site(s) by procedures known in the art. Such 
procedures and others are deemed to be within the scope of those skilled in 

2 5 the art. 

The DNA sequence in the expression vector is operatively 
linked to an appropriate expression control sequence(s) (for example, a 
promoter sequence) to direct mRNA synthesis. As representative examples 
30 of such promoters, there may be mentioned: LTR or SV40 promoter, the E. 
coli. lac or trp, the phage lambda P L promoter and other promoters known 
to control expression of genes in prokaryotic or eukaryotic cells or their 
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viruses. The expression vector also contains a ribosome binding site for 
translation initiation and a transcription terminator. The vector may also 
include appropriate sequences for amplifying expression. 

5 In addition, the expression vectors preferably contain one or 

more selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture, or such as tetracycline or ampicillin 
resistance in E. coli . 

10 

The vector containing the appropriate DNA sequence as 
hereinabove described, as well as an appropriate promoter or control 
sequence, may be employed to transform an appropriate host to permit the 
host to express the protein. 

15 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as E. coli , Streptomyces , Salmonella 
typhimurium ; funga! cells, such as -yeast; insect cells such as Drosophila S2 
and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 

2 0 adenoviruses; plant cells, etc. The selection of an appropriate host is 

deemed to be within the scope of those skilled in the art from the teachings 
herein. 

"Recombinant expression vehicle or vector" refers to a 
25 plasmid or phage or virus or vector, for expressing a polypeptide from a 
DNA (RNA) sequence. The expression vehicle can comprise a 
transcriptional unit comprising an assembly of (1) a genetic element or 
elements having a regulatory role in gene expression, for example, 
promoters or enhancers, (2) a structural or coding sequence which is 

3 0 transcribed into mRNA and translated into protein, and (3) appropriate 

transcription initiation and termination sequences. Structural units 
intended for use in yeast or eukaryotic expression systems preferably 
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include a leader sequence enabling extracellular secretion of translated 
protein by a host cell. Alternatively, where recombinant protein is 
expressed without a leader or transport sequence, it may include an N- 
terminai methionine residue. This residue may or may not be 
5 subsequently cleaved from the expressed recombinant protein to provide 
a final product. 

"Recombinant expression system" means host celts which 
have stably integrated a recombinant transcriptional unit into 
10 chromosomal DNA or carry the recombinant transcriptional unit extra 
chromosomally. The cells can be prokaryotic or eukaryottc. Recombinant 
expression systems as defined herein will express heterologous protein 
upon induction of the regulatory elements linked to the DNA segment or 
synthetic gene to be expressed. 

15 

Mature proteins can be expressed in mammalian cells, yeast, 
bacteria, or other cells under the control of appropriate promoters. Cell- 
free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. 

2 0 Appropriate cloning and expression vectors for use with prokaryatic and 

eukaryotic hosts are described by Sambrook, et al., Molecular Cloning: A 
Laboratory Manual, Second Edition, (Cold Spring Harbor, N.Y., 1989), 
Wu et ai, Methods in Gene Biotechnology (CRC Press, New York, NY, 
1997), and Recombinant Gene Expression Protocols, in Methods in 
25 Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, NJ, 
1997), the disclosures of which are hereby incorporated by reference. 

Transcription of the DNA encoding the polypeptides 
according to the present invention by higher eukarotes can be increased 

3 0 by insertion of an enhancer sequence into the vector. Such enhancers 

have been known for some time and are usually cis-acting elements of 
DNA, usually anywhere from 10 to 300 bp that act on a promoter to 
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increase transcription. Common examples include the SV40 enhancer, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer and the 
enhancers found in adenovirus. 

5 Generally, recombinant expression vectors will include origins 

of replication and selectable markers permitting transformation of the host 
cell, e.g., the ampiciiiin resistance gene of E. coli and S. cerevisiae TRP1 
gene, and a promoter derived from a highly-expressed gene to direct 
transcription of a downstream structural sequence. Such promoters can 

10 be derived from operons encoding glycolytic enzymes such as 3- 
phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock 
proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination 
sequences, and preferably, a leader sequence capable of directing" 

15 secretion of translated protein into the peripiasmic space or extracellular 
medium. Optionally, the heterologous sequence can encode a fusion 
protein including an N-terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed 
recombinant product. 

20 

Useful expression vectors for bacterial use are constructed 
by inserting a structural DNA sequence encoding a desired protein 
together with suitable translation initiation and termination signals in 
operable reading phase with a functional promoter. The vector will 

25 comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desirable, 
provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtttis. Salmonella typhimurium 
and various species within the genera Pseudomonas, Streptomyces, and 

30 Staphylococcus, although others may also be employed as a matter of 
choice. 
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As a representative but noniimiting example, useful 
expression vectors for bacterial use can comprise a selectable marker and 
bacterial origin of replication derived from commercially available plasmids 
comprising genetic elements of the well known cloning vector pBR322 
5 (ATCC 37017). Such commercial vectors include, for example, pKK223- 
3 {Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega 
Biotec, Madison, Wl, USA). These pBR322 "backbone" sections are 
combined with an appropriate promoter and the structural sequence to be 
expressed. 

10 

Following transformation of a suitable host strain and growth 
of the host strain to an appropriate cell density, the selected promoter is 
derepressed by appropriate means (e.g., temperature shift or chemical 
induction) and cells are cultured for an additional period. Cells are' 
15 typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Various mammalian cell culture systems can also be 
employed to express recombinant protein. Examples of mammalian 

2 0 expression systems include the COS-7 lines of monkey kidney fibroblasts, 

described by Gluzman, Cell , _23:175 (1981), and other cell lines capable 
of expressing a compatible vector, for example, the C127, 3T3, CHO, 
HeLa and BHK cell lines. Mammalian expression vectors will comprise an 
origin of replication, a suitable promoter and enhancer, and also any 
25 necessary ribosome binding sites, potyadenyiation site, splice donor and 
acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral 
genome, for example, SV40 origin, early promoter, enhancer, splice, and 
polyadenylation sites may be used to provide the required nontranscribed 

3 0 genetic elements. 
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Recombinant protein produced in bacterial culture is 
conveniently isolated by initial extraction from cell pellets, followed by 
one or more salting-out, aqueous ion exchange or size exclusion 
chromatography steps. Protein refolding steps can be used, as necessary, 
5 in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final 
purification steps. Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents. 

10 

The protein, its fragments or other derivatives, or analogs 
thereof, or cells expressing them, can be used as an immunogen to 
produce antibodies thereto. These antibodies can be, for example, 
polyclonal, monoclonal, chimeric, single chain, Fab fragments, or the* 
15 product of an Fab expression library. Various procedures known in the art 
may be used for the production of polyclonal antibodies. 

Antibodies generated against the polypeptide corresponding 
to a sequence of the present invention can be obtained by direct injection 

2 0 of the polypeptide into an animal or by administering the polypeptide to 

an animal, preferably a nonhuman. The antibody so obtained will then 
bind the polypeptide itself. In this manner, even a sequence encoding 
only a fragment of the polypeptide can be used to generate antibodies 
binding the whole native polypeptide. Such antibodies can then be used 
25 to isolate the polypeptide from tissue expressing that polypeptide. 
Moreover, a panel of such antibodies, specific to a large number of 
polypeptides, can be used to identify and differentiate such tissue. 

For preparation of monoclonal antibodies, any technique 

3 0 which provides antibodies produced by continuous cell line cultures can 

be used. Examples include the hybridoma technique (Kohler and Milstein, 
1975, Nature, 256 :495-497), the trioma technique, the human B-cell 
36 
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hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and 
the EBV-hybridoma technique to produce human monoclonal antibodies 
( Cole , et al., 1985, in Monoclonal Antibodies and Cancer Therapy , Alan 
R. Liss, Inc., pp. 77-96). 

5 

Techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce single 
chain antibodies to immunogenic polypeptide products of this invention. 

10 The antibodies can be used in methods relating to the 

localization and activity of the protein sequences of the invention, e.g., 
for imaging these proteins, measuring levels thereof in appropriate 
physiological samples and the like. 

15 In carrying out the procedures of the present invention it is of 

course to be understood that reference to particular buffers, media, 
reagents, cells, culture conditions and the like are not intended to be 
limiting, but are to be read so as- to include all related materials that one 
of ordinary skill in the art would recognize as being of interest or value in 

2 0 the particular context in which that discussion is presented. For example, 

it is often possible to substitute one buffer system or culture medium for 
another and still achieve similar, if not identical, results. Those of skill in 
the art will have sufficient knowledge of such systems and methodologies 
so as to be able, without undue experimentation, to make such 
25 substitutions as will optimally serve their purposes in using the methods 
and procedures disclosed herein. 

Specific embodiments of the invention will now be further 
described in more detail in the following non-limiting examples and it will 

3 0 be appreciated that additional and different embodiments of the teachings 

of the present invention will doubtless suggest themselves to those of 
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skill in the art and such other embodiments are considered to have been 
inferred from the disclosure herein. 

5 EXAMPLE 

The proteins encoded by the nucleotide sequences of SEQ ID NOS: 
1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27 and 28 are expressed 
in U20S cells. This is achieved by selectively PCR amplifying the coding 

10 regions thereof (based on the available open reading frames) and then 
cloning the resulting amplicon into a suitable mammalian expression 
vector. One such vector is pcDNA3.1 (sold by fnvitrogen - #K4800-01>. 
The expression of the protein encoded by the described polynucleotide 
sequence is detected in either of two ways: by use of specific antibodies* 

15 raised against peptides derived from the amino acid sequence or by use of 
antibodies against tags added during the cloning procedure. Examples of 
such tags are the V5 epitope or a poly-histidine sequence as contained in 
the pcDNA3.1 vector. In order to accomplish this, cells will normally be 
transfected with the expression construct and cultured for 1 to 5 days. 

2 0 Cells will then be lysed and their protein content analyzed by western 

blotting using the above antibodies as appropriate. Cells will also be 
analyzed for the subcellular localization of the protein encoded by the 
described polynucleotide sequence by transfecting cells in suitable 
chambers, culturing them for 1 to 5 days and fixing them in situ. Such 
25 cells will then be analyzed for the presence and localization of the 
encoded protein by staining ceils with the above-referenced antibodies. 
Alternatively, cells will be transfected with an expression system in which 
the protein encoded by the described polynucleotide sequence is fused to 
a directly detectable tag such as green fluorescent protein (GFP). The 

3 0 expression and localization of the protein encoded by the described 

polynucleotide sequence is then detected by analyzing that of GFP. 
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For purposes of identification of the polypeptides disclosed herein, 
each such polypeptide is listed in the table below along with its calculated 
molecular weight (Daltons) and its expected isoelectric point (pi). 

5 Table 1 . 

SEQ ID NO: # Residues Mol. Wt. pi 



2 


410 


45786.9 


8.96 


4 


227 


26152.3 


8.48 


6 


275 


30781.6 


10.00 


10 


84 


8913.2 


9.35 


12 


281 


30386.7 


9.35 


14 


322 


32977.3 


9.27 


16 


141 


1 6444.4 


9.34 


18 


219 


24418.4 


9.07 


22 


56 


6356.3 


7.85 


24 


344 


37375.6 


5.82 


26 


208 


23864.9 


9.71 


29 


531 


60,576.6 


9.63 



The polypeptides of SEQ ID NOS: 8 and 20 corresponded only to 
partial sequences and thus no values could be calculated and such 
sequences are not in the table. 

25 

All of the polynucleotides from which these polypeptide sequences 
are derived are cDNAs isolated during a differential screen of osteogenic 
mesenchymal stem cells (MSCs) cultured for 4 days in the presence of 
osteogenic supplements. 

30 
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SEQUENCE LISTING 

<110> Van Den Boss, Christian 
Mbalaviele, Gabriel 

<12 0> Human Mesenchymal DNAs and Expression Products 

<130> 640100-363 

<140> 
<141> 

<150> 60/127,418 
<151> 1999-04-01 

<150> 60/148,800 
<151> 1999-08-13 

<160> 29 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1974 
<:212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 1 

ggcacgagca gaacaagttt ggcctgtcac tgctcctcat cctcctgagc cgtggtgaag 60 
acctacagag ttcagaccct gctacagaat caacacaaaa taatcagtgg acggaggtga 12 0 
tgttcatggc aacacgagaa cttctgcgga ttccccaagc agccctggcc aagccaatct 18 0 
ctatacctac aaacctagtg tccctctttt ctcgctatgt tgaccggcag aaactgaact 24 0 
tgctggagac aaaactgcag ctagttcagg ggatacgata aaagatctcc aaatgtgtcc 300 
tgtacctcct tttggctgcc acctgcactg ctgccatcac caatggagtg tttttaatga 360 
gggaaggaag gtagcttttt ccccaaagca aagtcttgtg ggatcgattc ctgtttacag 420 
gggttgtctc tctaaatgtc agatatttcc ccactgctct atgaaatttg gctgggtgat 48 0 
acttctgctg gtttctttac cttctgtgtt acagttctgc atgtcctact tttactcagt 54 0 
tctgttttgc attttcfcttg ccctagagac acaagtgtaa tctctccctt tatccctcca 60 0 
ctactccacc tcagagtaga ttgtagcctg ccaaaggatt ccttccctca tcctattgaa 660 
gttgtttttt cattgcccca tattaatatg actatagaag agccaattaa gtagaatcaa 720 
gatatacaca cacacataga tacacacaca cacaccccat acatgtattt atgtggtctt 780 
cagagggtcc ttaaagaatg aattttagat tgaaaaatat ttagttgtct cattacctct 840 
tctaaacaca aaccagctga tgtattttaa tctgtttctg ttctatcttg taattaattt 900 
ggtgggttct acttgtttta acataaataa agagtatgca gcacgtttaa taaaatcaga 960 
actcttaatt ggcttatgcc caggtctagg ctgagaagtc ctttttcttc ttcccacctt 102 0 
tatttcctta gtttctgtcc accttaatcg aaacaacaca tggttatgtc tttttcctgc 1080 
tacaactaca gggtacttga gcctttcccc tcaagtgcat tcgaagtcac ccaggatgat 114 0 
cctcactagt agcctgcttt ggcagtgtgg ctttttgcac acttgccctg tcttcctgag 1200 
actacttcag taagccatgc ttccttcttc cccactttta tttggtgtca tgaatagaaa 126 0 
cttccaaatg taaccatgga agctaagttt ggcctgcttt gctttttagt ctccacacca 1320 
tgggcagaac tgctgtcttt actacttcat ctcacccaag tcccgttccc aggcagccag 1380 
gggcctgggt ttgaataatt gcagggccag cctgccatga tctttctcac ttactcctct 1440 
cccattcagc aatcaaccag actaaggagt tttgatccct agtgattaca gccctgaaga 1500 
aaattaaatc tgaattaatt ttacatggcc ttcgtgatct ttctgctgtt cttacttttt 1560 
cgaatgtagt tggggggtgg gagggacagg ttatggtatt taaagagaat aaacattttg 1620 
cacatacatg tattgtacaa cagtaagatc ctctgttaaa accagctgtc ctgttctcca 168 0 
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tctccatttc ttcccatgct gtaaccccag gctccaccag ctgttcccca gtgatgttac 1740 

ctagcttccc tctaccgttg tctactgacc atttccacta catgcctttc ctaccttccc 1800 

ttcacaacca atcaagtgaa tacttgatta ttatctcttc cttactgtgc tttatctttt 1860 

ttgtttggat tggttctaat taatgaaaat aaaagtttct aaatttacat ttttataggg 192 0 

tattgtaaat aaaaacaaat tgtatactta aaaaaaaaaa aaaaaaaaaa aaaa 1974 

<210> 2 

<211> 410 
<212s> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 
protein derived from ORF cDNA of SEQ ID N0:1 

<400> 2 

Met Glu Leu Pro Ser Gly Pro Gly Pro Glu Arg Leu Phe Asp Ser His 
15 10 15 

Arg Leu Pro Gly Asp Cys Phe Leu Leu Leu Val Leu Leu Leu Tyr Ala 
20 25 30 

Pro Val Gly Phe Cys Leu Leu Val Leu Arg Leu Phe Leu Gly lie His 
35 40 45 

Val Phe Leu Val Ser Cys Ala Leu Pro Asp Ser Val Leu Arg Arg Phe 
50 55 60 

Val Val Arg Thr Met Cys Ala Val Leu Gly Leu Val Ala Arg Gin Glu 
65 70 75 80 

Asp Ser Gly Leu Arg Asp His Ser Val Arg Val Leu He Ser Asn His 
85 90 95 

Val Thr Pro Phe Asp His Asn lie Val Asn Leu Leu Thr Thr Cys Ser 
100 105 no 

Thr Pro Leu Leu Asn Ser Pro Pro Ser Phe Val Cys Trp Ser Arg Gly 
115 120 125 

Phe Met Glu Met Asn Gly Arg Gly Glu Leu Val Glu Ser Leu Lys Arg 
130 135 140 

Phe Cys Ala Ser Thr Arg Leu Pro Pro Thr Pro Leu Leu Leu Phe Pro 
3-45 150 155 160 

Glu Glu Glu Ala Thr Asn Gly Arg Glu Gly Leu Leu Arg Phe Ser Ser 
165 170 175 

Trp Pro Phe Ser He Gin Asp Val Val Gin Pro Leu Thr Leu Gin Val 
180 185 190 

Gin Arg Pro Leu Val Ser Val Thr Val Ser Asp Ala Ser Trp Val Ser 
195 200 205 

Glu Leu Leu Trp Ser Leu Phe Val Pro Phe Thr Val Tyr Gin Val Arg 
210 215 220 

Trp Leu Arg Pro Val His Arg Gin Leu Gly Glu Ala Asn Glu Glu Phe 
225 230 235 240 
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Ala Leu Arg Val Gin Gin Leu Val Ala Lys Glu Leu Gly Gin Thr Gly 

245 250 255 

Thr Arg Leu Thr Pro Ala Asp Lys Ala Glu His Met Lys Arg Gin Arg 

2S0 265 270 

His Pro Arg Leu Arg Pro Gin Ser Ala Gin Ser Ser Phe Pro Pro Ser 

275 280 285 

Pro Gly Pro Ser Pro Asp Val Gin Leu Ala Thr Leu Ala Gin Arg Val 
290 295 300 

Lys Glu Val Leu Pro His Val Pro Leu Gly Val lie Gin Arg Asp Leu 

305 310 315 320 

Ala Lys Thr Gly Cys Val Asp Leu Thr lie Thr Asn Leu Leu Glu Gly 

325 330 335 



Ala Val Ala Phe Met Pro Glu Asp 
340 

Pro Thr Ala Ser Ala Ser Lys Phe 
355 360 



lie Thr Lys Gly Thr Gin Ser Leu 
345 350 

Pro Ser Ser Gly Pro Val Thr Pro 
365 



Gin Pro Thr Ala Leu Thr Phe Ala Lys Ser Ser Trp Ala Arg Gin Glu 

370 375 380 

Ser Leu Gin Glu Arg Lys Gin Ala Leu Tyr Glu Tyr Ala Arg Arg Arg 
385 390 395 400 

Phe Thr Glu Arg Arg Ala Gin Glu Ala Asp 
405 410 



<210> 3 
<211> 1619 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 3 

ggcacgaggc gccttctcct tgcttctggg ggtcgtggcc ttgctcccgc tgtgcgggaa 60 
aagaatccag gcccttccac gcgcgtgtgg gtgcgggggc cccgaagtgc tcgtggttcc 12 0 
ccgctaggtc tccgctgggg caggaaccgg aatcatgggt gggaccacca gcacccgccg 18 0 
ggtcaccttc gaggcggacg agaatgagaa catcaccgtg gtgaagggca tccggctttc 24 0 
ggaaaatgtg attgatcgaa tgaaggaatc ctctccatct ggttcgaagt ctcagcggta 3 00 
ttctggtgct tatggtgcct cagtttctga tgaagaattg aaaagaagag tagctgagga 36 0 
gctggcattg gagcaagcca agaaagaatc cgaagatcag aaacgactaa agcaagccaa 42 0 
agagctggac cgagagaggg ctgctgccaa tgagcagtta accagagcca tccttcggga 480 
gaggatatgt agcgaggagg aacgcgctaa ggcaaagcac ctggctaggc agctggaaga 54 0 
gaaagaccga gtgctaaaga agcaggatgc attctacaaa gaacagctgg ctagactgga 60 0 
ggagaggagc tcagagttct acagagtcac cactgaacaa tatcagaaag ctgctgaaga 660 
gtggaagcaa agttcaagcg atatgagtct catccagtct gtgctgatct gcaggccaaa 720 
attcttcagt gttaccgtga gaacacccac cagaccctca aatgctccgc tctggccacc 78 0 
cagtatatgc actgtgtcaa tcatgccaaa cagagcatgc ttgagaaggg aggataaaaa 840 
ctttcagaat gagcaaaaca ccatcaacgt taattccaga gatggaacat tttttttcct 900 
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agtgagaaaa caacccattt gaagagaaga ccactaatga gaagaccact aaagagagac 96 0 
atcaagaatg gattcagcag aatcatttca cgttttgaac agcagcagtt tgaagggcca 1020 
aagccttgat cagggatcag tcattaaagg acactcttga gtattagtaa accctcttat 1080 
gatgattaaa agagaagggc agccctctcc accttttggt actttctatt caacttgcac 1140 
tgaccataaa atgtttctct tctgaacaag ccccatcatt tggtgaacct ccaccctaac 1200 
Eiaagtaggat ggggttgggg gctaaattaa ttggagtggg gcgaggagag agccagaaaa 12 60 
catagatccg agggcagcag tgctgggtgg agagagccag aaaacagatc tggaggcagc 1320 
agtgctggat ggaattgtct aggctgtggc atgttggttt tgtctttctt ttctcctttg 13 80 
attatgtaag agctatttca ttataactta ttatggtgat tatacaggca agaagacaaa 1440 
aaggagagaa aatgtacctc ttctactgga ataatgttta tgattacaag tgagataagg 15 00 
tatttttatc aatatgaagg caaccttggc tgataaaacc tctatagtga atactcacat 1560 
ctttacttca ctcactatca ataataaata fcattttctga caaaaaaaaa aaaaaaaaa 1619 



<210> 4 
<211> 227 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 3. 

<400> 4 

Met Gly Gly Thr Thr Ser Thr Arg Arg Val Thr Phe Glu Ala Asp Glu 
15 10 15 

Asn Glu Asn lie Thr Val Val Lys Gly lie Arg Leu Ser Glu Asn Val 
20 25 30 

lie Asp Arg Met Lys Glu Ser Ser Pro Ser Gly Ser Lys Ser Gin Arg 
35 40 45 

Tyr Ser Gly Ala Tyr Gly Ala Ser Val Ser Asp Glu Glu Leu Lys Arg 
50 55 60 

Arg Val Ala Glu Glu Leu Ala Leu Glu Gin Ala Lys Lys Glu Ser Glu 
65 70 75 80 

Asp Gin Lys Arg Leu Lys Gin Ala Lys Glu Leu Asp Arg Glu Arg Ala 
85 90 95 

Ala Ala Asn Glu Gin Leu Thr Arg Ala lie Leu Arg Glu Arg lie Cys 
100 105 110 

Ser Glu Glu Glu Arg Ala Lys Ala Lys His Leu Ala Arg Gin Leu Glu 
115 120 125 

Glu Lys Asp Arg Val Leu Lys Lys Gin Asp Ala Phe Tyr Lys Glu Gin 
130 135 140 

Leu Ala Arg Leu Glu Glu Arg Ser Ser Glu Phe Tyr Arg Val Thr Thr 
145 150 155 160 

Glu Gin Tyr Gin Lys Ala Ala Glu Glu Val Glu Ala Lys Phe Lys Arg 
165 170 175 

Tyr Glu Ser His Pro Val Cys Ala Asp Leu Gin Ala Lys lie Leu Gin 
180 185 190 

Cys Tyr Arg Glu Asn Thr His Gin Thr Leu Lys Cys Ser Ala Leu Ala 
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195 200 205 

Thr Gin Tyr Met His Cys Val Asn His Ala Lys Gin Ser Met Leu Glu 
210 215 220 

Lys Gly Gly 
225 



<210> 5 
<211> 1917 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<4 00> 5 

ggcacgagga gagaggtggt gaaactcaag atccccatgg acaacaagga ggtcccgggc 6 0 
gaggcgcccg cgccgtccgc cgacccggcg cgtccccacg cgtgccccga ctgcggccgc 12 0 
gccttcgcgc gccgctccac gctggcgaag cacgcgcgca cgcacacggg cgaacggccc 18 0 
ttcgggtgca ccgagtgcgg gcggcgcttc tcacagaagt cggcgctgac caaacacggc 24 0 
cgcacgcaca cgggcgagcg gccctacgag tgccccgagt gcgacaaacg cttctcggcc 3 00 
gcctcgaacc tgcggcagca ccgacggcgg cacacgggcg agaagccgta cgcatgcgcg 3 60 
cactgcggcc gccgcttcgc gcagagctcc aactacgcac agcacctgcg cgtgcacacg 42 0 
ggcgagaagc cgtacgcgtg cccggactgc ggacgcgcct ttggcggcag ctcgtgcctg 480 
gcgcgccacc gacgcacgca cacgggcgag cggccctacg cttgcgccga ctgcggcacg 54 0 
cgcttcgctc agagctcggc gctggccaag caccggcgcg tgcacacggg cgagaagccg 6 00 
caccgctgcg ctgtgtgtgg ccgtcgcttc ggccaccgct ccaacctggc ggagcacgcg 66 0 
cgcacgcaca caggcgagcg gccctacccc tgcgccgagt gcggccgccg cttccgccta 72 0 
agctcgcact tcattcgcca ccgacgcgcg cacatgcggc gccgcctgta tatttgcgcc 780 
ggctgcggca gggacttcaa gctgccccct ggcgccacgg ccgccactgc caccgagcgt 84 0 
tgcccggagt gtgagggcag ctgagtcccg cagggctgcg gaggggcgcg ctggggcttc 900 
gacctggctg cactaaccca ggctcctcct cgccccggcc tccgggtctg ggaaattgag 960 
gggacggcag gcccggctgc cctggaactg ggagacaggg agaatcccct gccggggtcc 1020 
ctggaaacag tgcccacccc acatcactac attccctcgg cccgtgttag tgaataaagt 1080 
attatatcct caccccaqcc gtgcctgtga gtgaggtggg tgggagagga agaaagttgg 1140 
ggttctccag gctcaggtgc caagtgagtt gtcaaggaac caaatgggga tgtaaaccta 12 00 
aaaggggttc ccggcacctc ggtttgtgtt ggttggaggt gatcgcacac ttggcccttg 1260 
gttacgtcct cataacctta gacctgaaag ggcccataaa tatactatgt tcacgatcag 1320 
acacgcactg cattcggcag agctccagtg agcaaggcac gaccctcaga tctcagtcta 13 80 
gtgaaggaga gaaaactgta ataacactac gttaaaggtt ttaactgctt tgttatgtaa 144 0 
gcttacccag cccggcgcac agtgactcac gcctgtaatc ccagcacttt gggagggcga 1500 
ggctagcaga tcacttgagg ttaggagttc gataccagcc tggccaacat ggtgaaaccc 1560 
ggtctctact aaaaatacaa aaattaactg ggtgtggtgg cgggcgcctg taatcccagc 1620 
tactgagggg gctgaggcat gagaatcact tgaacctggg agacagaggt tgcaatgaac 1680 
cgagatagtg ccattgcact ccggcctggg caacagagga agactgcctc aaacaaacaa 174 0 
aaaacaacaa accaaaccaa accaaaaaaa tctcaaagcg attggaccta gcagctcatg 1800 
cctgtaatct ccagcacttt gggaggcgga ggcaggagga tctcttgaag tcaagagttt 1860 
cjagatcagcc tggagaacaa agtgagaccc ccatctatta aaaaaaaaaa aaaaaaa 1917 



<210> 6 

<211> 275 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 
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protein derived from ORF of cDNA of SEQ ID NO : 5. 
<400> 6 

Met Asp Asn Lys Glu Val Pro Gly Glu Ala Pro Ala Pro Ser Ala Asp 
15 10 15 

Pro Ala Arg Pro His Ala Cys Pro Asp Cys Gly Arg Ala Phe Ala Arg 
20 25 30 

Arg Ser Thr Leu Ala Lys His Ala Arg Thr His Thr Gly Glu Arg Pro 
35 40 45 

Phe Gly Cys Thr Glu Cys Gly Arg Arg Phe Ser Gin Lys Ser Ala Leu 
50 55 60 

Thr Lys His Gly Arg Thr His Thr Gly Glu Arg Pro Tyr Glu Cys Pro 
65 70 75 80 

Glu Cys Asp Lys Arg Phe Ser Ala Ala Ser Asn Leu Arg Gin His Arg 
85 90 95 

Arg Arg His Thr Gly Glu Lys Pro Tyr Ala Cys Ala His Cys Gly Arg 
100 105 110 

Arg Phe Ala Gin Ser Ser Asn Tyr Ala Gin His Leu Arg Val His Thr 
115 120 125 

Gly Glu Lys Pro Tyr Ala Cys Pro Asp Cys Gly Arg Ala Phe Gly Gly 
130 135 140 

Ser Ser Cys Leu Ala Arg His Arg Arg Thr His Thr Gly Glu Arg Pro 
145 150 155 160 

Tyr Ala Cys Ala Asp Cys Gly Thr Arg Phe Ala Gin Ser Ser Ala Leu 
165 170 175 

Ala Lys His Arg Arg Val His Thr Gly Glu Lys Pro His Arg Cys Ala 
180 185 190 

Val Cys Gly Arg Arg Phe Gly His Arg Ser Asn Leu Ala Glu His Ala 
195 200 205 

Arg Thr His Thr Gly Glu Arg Pro Tyr Pro Cys Ala Glu Cys Gly Arg 
210 215 220 

Arg Phe Arg Leu Ser Ser His Phe lie Arg His Arg Arg Ala His Met 
225 230 235 240 

Arg Arg Arg Leu Tyr lie Cys Ala Gly Cys Gly Arg Asp Phe Lys Leu 
245 250 255 

Pro Pro Gly Ala Thr Ala Ala Thr Ala Thr Glu Arg Cys Pro Glu Cys 
260 265 270 

Glu Gly Ser 
275 



<210> 7 
<211> 1150 
<212> DNA 
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<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 7 

ggcacgagct gcggaaaatc tttccgctca gcacatcacc tcaaagtgca tctgcgagtg 60 
cacacaggcg agcggcccta caagtgtccg cactgcgact acgcgggcac ccagtccggc 120 
tcgctcaagt atcacctaca gcgccaccac cgggagcaga ggagcggggc cggccccggg 180 
ccacccccgg agccaccgcc tccttcccag cggggttcgg ccccgcaatc tggagccaag 240 
ccgtctccgc agcctgcgac ctgggtggag ggcgcctcaa gtccccggcc tccttctagc 3 00 
ggtgctgggc cggggtcccg tcggaagccc gccagccctg ggaggaccct gcgcaacggg 360 
cgaggcggtg aggccgaacc cctggacctg tccttgcggg cagggccggg aggcgaggcc 42 0 
gggcctgggg gtgccctcca ccgctgcctc ttctgcccgt tcgccactgg agccccagag 48 0 
ctcatggcct tgcaccttca agtgcaccac agccgccggg ctaggggccg ccggccaccc 54 0 
caggctgacg cgtccccgcc ctatgcccga gtaccatcag gagagacccc tcccagtcct 60 0 
tcgcaggaag gggaggaggg ctccgggctg tccagacccg gagaggcagg gctggggggg 66 0 
caagaacggt agtgggccct caggggcgat tagcttagtg agcttacccc gccggagcgg 72 0 
gggagcgcta gaggtagttg ggtagagaag ccagcagggg gcagcgtggg acccatatcc 78 0 
cagccccagg cggaccagag gtggaaggcg cggcgggcgt aggagggtgg gctggcggta 84 0 
cccacccagc ccaagggagt cacagtgcct tagaagtggc agaggtaggg tcaaagaacg 900 
gggtccctgg gctggcagag aaggtttgtg tccctgttga ggagccactc tgccagtctt 960 
tgctggtcca taggcgtgag agtttatttt tgtacaaggg gtgagggtgg ggggcactgt 1020 
iicccttctag ccccatcagg ggccctgtag acgtcgctat ttttggtacg attcctgtca 1080 
tccctattgc agagtcgtgt ccccaataaa caggtgtctt gaggcacagg aaaaaaaaaa 114 0 
aaaaaaaaaa 1150 



<210> 8 
<211> 100 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 7. 

<40 0> 8 

lie Arg His Glu Leu Arg Lys He Phe Pro Leu Ser Thr Ser Pro Gin 
15 10 15 

Ser Ala Ser Ala Ser Ala His Arg Arg Ala Ala Leu Gin Val Ser Ala 
20 25 30 

Leu Arg Leu Arg Gly His Pro Val Arg Leu Ala Gin Val Ser Pro Thr 
35 40 45 

Ala Pro Pro Pro Gly Ala Glu Glu Arg Gly Arg Pro Arg Ala Thr Pro 
50 55 60 

Gly Ala Thr Ala Ser Phe Pro Ala Gly Phe Gly Pro Ala lie Trp Ser 
65 70 75 80 

Gin Ala Val Ser Ala Ala Cys Asp Leu Gly Gly Gly Arg Leu Lys Ser 
85 90 95 

Pro Ala Ser Phe 
100 
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<210> 9 
<:211> 1573 
<:212> DNA 

<;213> Artificial Sequence 
<:220> 

<:223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 9 

ggcacgaggt gggctgcacg ctcccctcca tccccatcgg ccctgtcccc tggagtgtgt 6 0 
cagagcccag gggagaatgc agcccaccag gagcacctgg accccctgcc cgccacatgg 12 0 
tgtggccatc actcagcccc tacccctgcc ctgctcctaa gggtagaaaa ctccagggtc 18 0 
ccctgccacc gactgcccag ccactccaag ccccctggca gctgcccctc ctggagcaga 240 
aagtgccttt atctcagcca tccgcagact gctcggccag atgcggggac aggctggaat 3 00 
gagggaggcg tcttcatctc cctgccatcc ccctctcacg ccacccccgc ccccaccggg 3 60 
ctgcaggtgc tgctgatgcg ctgggatctg attgaggata aaaaggaagg agagatgacc 420 
cctaccccct catcccccag ttttgaaaag gtctaagcaa gtgagtctgg tggaggagct 480 
gagggaggga gccatggaag gtgccagaag gaaggttggc gggggcacgt gtgggccgfcg 540 
gcttgggctg gtcagagtgg cgtgagctgc ccggcgcctg ccctgcccaa gtgaccaggg 600 
aagtgtgtgt gtgtccatgt gtatgcgtgt ccgtctgtct gtctagtgtc tgggtttggc 660 
ccaagactgg gctgtagtta cattaatgcc cagccagcca cccctgccac tcacccctcc 72 0 
tggcccaggc cttgctgact ctctgagctg gggaggtggg aggccaggcg agcctgactc 780 
tgttgatcta cccgtgcctg ggcccctccc ctcagagccc atggtaacga acccctagaa 840 
aggagagaac gggcgtcagg ggtgcacagt ccacagctga agagcaaggt ttcgtggcag 9 00 
cacggcccgg cccctcaccc tctgtcccca cgaggggacc catgggggct gtctttgcag 9 60 
ggcacagatg accaaagtcc cttcctgctt cctgttacct gtcttgctcc tggggagaaa 1020 
gaggggcctg atgagactcc actcaggtgc acacatcacc aggtgcatct gcaggcactg 1080 
ggctggctgc ttgcagccag gagaaggtca gcgagaagga gtgtatgagt gtgagtgtgt 1140 
gtgcatggaa gttggggcac tgggcgtctg actccctccc cacccaagag aggaaggacc 1200 
cctcaccacc cccactggcg agacagttta ctttgccgac ttgccatgtt tttgccaaaa 1260 
ccaagatttt gaaggaaatg agtggccagc gccagggccc aggccatgtg gcctgcccag 1320 
cctcaatgtc acttggtggc ggggtggggt gggggtgggc agcagcatcc cagccttgag 1380 
atgcttcact ttccttctct gtaaccagac tttgaaaaat tgttcgtttc atcaggctct 1440 
gttcctcaat ggccttttgc tacgtgcctc ccgagaaatt tgtctttttg tataaatgac 1500 
aaagtgttga aaatgtattt cctgaaataa atgtttcaaa tgcagaaacc caaaaaaaaa 1560 
aaaaaaaaaa aaa 1573 



<210> 10 
<211> 84 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 9. 

<400> 10 

Met Ser Val Ser Val Cys Ala Trp Lys Leu Gly His Trp Ala Ser Asp 
15 10 15 

Ser Leu Pro Thr Gin Glu Arg Lys Asp Pro Ser Pro Pro Pro Leu Ala 
20 25 30 

Arg Gin Phe Thr Leu Pro Thr Cys His Val Phe Ala Lys Thr Lys lie 
35 40 45 

Leu Lys Glu Met Ser Gly Gin Arg Gin Gly Pro Gly His Val Ala Cys 
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Pro Ala Ser Met Ser Leu Gly Gly Gly Val Gly Trp Gly Trp Ala Ala 



Ala Ser Gin 



<210> 11 
<211> 1808 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with, growth supplement 

<400> 11 

ggcacgaggc cgcccggccg agcgcggagc gcagccactc gccgctgccc agggagcgcc 60 

caagatgtgg ggggaccggg gcggcagcgg ccgtagcagc gccagggacg ggggcacgca 120 

gcagcctccg ctcgcccgcc tgtcctgacc tgcctcgctt gcccccaaag aatgtcagcc 180 

aagtccaagg ggaacccctc ctcgtcctgt ccagccgagg gaccgccggc agcctccaaa 240 

accaaggtga aggaacagat caagatcatc gtggaggatt tggaattagt cctgggcgac 3 00 

ctgaaggacg tggccaagga acttaaggag atgaagtccc actctgttgc ccaggctaga 3 60 

gtgcaatggc acaatctggg ctcactgcaa cctctgcctc ccaggttcaa gctattctcc 420 

tgcctcagcc tgcctcagtg cgccactacg cctgggtggt tgaccagatt gacaccctga 480 

cctctgacct acagctggag gatgagatga ctgacagctc caaaacggac acgctgaata 540 

gtagctcaag tggcacaaca gcctccagcc tagagaagat caaagtgcag gctaatgcac 600 

cgcttattaa acccccagca cacccatctg ctatcctcac ggtcctgaga aagccaaacc 660 

ctccaccacc tcctccaagg ttgacacctg tgaagtgtga agaccccaaa agggtggttc 720 

caactgccaa tcctgtaaaa accaatggca cccttctacg aaatggaggc ttaccaggtg 780 

gacctaacaa aattccaaat ggagatatct gctgcatacc caacagtaac ttggacaagg 840 

ctccagtcca gcttctgatg catagacctg aaaaagacag atgtccccag gcagggcctc 900 

gagaacgagt tcggtttaat gaaaaagtac agtaccatgg ctattgtcct gactgtgata 960 

cccggtataa cataaaaaac agggaggtcc acttacacag tgaacctgtc cacccaccgg 102 0 

gaaagattcc tcaccaaggc cctcccctcc ctcctacacc ccatctccct cctttcccac 108 0 

tagaaaatgg gggaatggga ataagccaca gtaacagctt cccccctatc agacctgcaa 1140 

ctgtgcctcc tcccactgca ccaaaaccac agaagacgat cttgaggaag tcaaccacta 1200 

caaccgtgtg atgtatgcca ttaaaaaaat tgttttttta attttctata ttataaacat 1260 

aaaataagta atgagcactt tctactcaag caataaaaag cccaaatata ttaatcctgc 1320 

attcagcaaa gtggcataaa aatcacctgg taagtatgca gcacattgct tatatcctgg 1380 

gtatgcatta ttttaaatgt tgtatcatta aaaacctcag aatgatgaaa aatatgaatg 1440 

atgcattgtt tttgcaattg acctatgaca aactgtgaac ctgcagattt cacctatttt 1500 

gatttactat aagagctggg atttgattca ttttatttat gcctaagtca tctatgcatt 1560 

aacatgtcat attcttaact ttgatctaat gctttttact aggaaatttt aatactgaag 1620 

gactatttta ttattttttt ctaaagatgt ttgtcactag tttttcatta ttaaatgctg 1680 

aggccaatac caagaagttt attttctata ttatacaatt atgaattaca tgctcagcta 1740 

tatatgtaat aaaatacttt ggtctgtgga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

1808 



<210> 12 
<211> 281 
<212> PRT 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 11. 
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<400> 12 

Met Ala Gin Ser Gly Leu Thr Ala Thr Ser Ala Ser Gin Val Gin Ala 
15 io is 

He Leu Leu Pro Gin Pro Ala Ser Val Arg His Tyr Ala Trp Val Val 
20 25 30 

Asp Gin He Asp Thr Leu Thr Ser Asp Leu Gin Leu Glu Asp Glu Met 
35 40 45 

Thr Asp Ser Ser Lys Thr Asp Thr Leu Asn Ser Ser Ser Ser Gly Thr 
50 55 60 

Thr Ala Ser Ser Leu Glu Lys He Lys Val Gin Ala Asn Ala Pro Leu 
S5 70 75 80 

He Lys Pro Pro Ala His Pro Ser Ala He Leu Thr Val Leu Arg Lys 
85 90 95 

Pro Asn Pro Pro Pro Pro Pro Pro Arg Leu Thr Pro Val Lys Cys Glu 
100 105 HO 

Asp Pro Lys Arg Val Val Pro Thr Ala Asn Pro Val Lys Thr Asn Gly 
115 120 125 

Thr Leu Leu Arg Asn Gly Gly Leu Pro Gly Gly Pro Asn Lys He Pro 
130 135 140 

Asn Gly Asp He Cys Cys He Pro Asn Ser Asn Leu Asp Lys Ala Pro 
145 150 155 160 

Val Gin Leu Leu Met His Arg Pro Glu Lys Asp Arg Cys Pro Gin Ala 
155 170 175 

Gly Pro Arg Glu Arg Val Arg Phe Asn Glu Lys Val Gin Tyr His Gly 
180 185 190 

Tyr Cys Pro Asp Cys Asp Thr Arg Tyr Asn He Lys Asn Arg Glu Val 
195 200 205 

His Leu His Ser Glu Pro Val His Pro Pro Gly Lys lie Pro His Gin 
210 215 220 

Gly Pro Pro Leu Pro Pro Thr Pro His Leu Pro Pro Phe Pro Leu Glu 
225 230 235 240 

Asn Gly Gly Met Gly He Ser His Ser Asn Ser Phe Pro Pro He Arg 
245 250 255 

Pro Ala Thr Val Pro Pro Pro Thr Ala Pro Lys Pro Gin Lys Thr He 
260 265 270 

Leu Arg Lys Ser Thr Thr Thr Thr Val 
275 280 



<210> 13 
<211> 1498 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 13 

ggcacgagcg cgactcgggc tccggacccg ggcactgctg gcggctggag cggagcgcac 60 

cgcggcggtg gtgcccagag cggagcgcag ctccctgccc cgcccctccc cctcggcctc 12 0 

gcggcgacgg cggcggtggc ggcttggacg actcggagag ccgagtgaag acatttccac 18 0 

ctggacacct gaccatgtgc ctgccctgag cagcgaggcc caccaggcat ctctgttgtg 24 0 

ggcagcaggg ccaggtcctg gtctgtggac cctcggcagt tggcaggctc cctctgcagt 300 

ggggtctggg cctcggcccc accatgtcga gcctcggcgg tggctcccag gatgccggcg 350 

gcagtagcag cagcagcacc aatggcagcg gtggcagtgg cagcagtggc ccaaaggcag 42 0 

gagcagcaga caagagtgca gtggtggctg ccgccgcacc agcctcagtg gcagatgaca 480 

caccaccccc cgagcgtcgg aacaagagcg gtatcatcag tgagcccctc aacaagagcc 540 

tgcgccgctc ccgcccgctc tcccactact cttcttttgg cagcagtggt ggtagtggcg 600 

gtggcagcat gatgggcgga gagtctgctg acaaggccac tgcggctgca gccgctgcct 660 

ccctgttggc caatgggcat gacctggcgg cggccatggc ggtggacaaa agcaacccta 72 0 

cctcaaagca caaaagtggt gctgtggcca gcctgctgag caaggcagag cgggccacgg 78 0 

agctggcagc cgagggacag ctgacgctgc agcagtttgc gcagtccaca gagatgctga 84 0 

agcgcgtggt gcaggagcat ctcccgctga tgagcgaggc gggtgctggc ctgcctgaca 90 0 

tggaggctgt ggcaggtgcc gaagccctca atggccagtc cgacttcccc tacctgggcg 960 

ctttccccat caacccaggc ctcttcatta tgaccccggc aggtgtgttc ctggccgaga 1020 

gcgcgctgca catggcgggc ctggctgagt accccatgca gggagagctg gcctctgcca 1080 

tcagctccgg caagaagaag cggaaacgct gcggcatgtg cgcgccctgc cggcggcgca 114 0 

tcaactgcga gcagtgcagc agttgtagga atcgaaagac tggccatcag atttgcaaat 1200 

tcagaaaatg tgaggaactc aaaaagaagc cttccgctgc tctggagaag gtgatgcttc 1260 

cgacgggagc cgccttccgg tggtttcagt gacggcggcg gaacccaaag ctgccctctc 1320 

cgtgcaatgt cactgctcgt gtggtctcca gcaagggatt cgggcgaaga caaacggatg 13 8 0 

cacccgtctt tagaaccaaa aatattctct cacagatttc attcctgttt ttatatatat 144 0 

attttttgtt gtcgttttaa catctccacg tccctagcat aaaaaaaaaa aaaaaaaa 14 98 



<210> 14 
<2li> 361 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 13. 

<4 0 0> 14 

Met Ser Ser Leu Gly Gly Gly Ser Gin Asp Ala Gly Gly Ser Ser Ser 
15 10 15 

Ser Ser Thr Asn Gly Ser Gly Gly Ser Gly Ser Ser Gly Pro Lys Ala 
20 25 30 

Gly Ala Ala Asp Lys Ser Ala Val Val Ala Ala Ala Ala Pro Ala Ser 
35 40 45 

Val Ala Asp Asp Thr Pro Pro Pro Glu Arg Arg Asn Lys Ser Gly lie 
50 55 60 

lie Ser Glu- Pro Leu Asn Lys Ser Leu Arg Arg Ser Arg Pro Leu Ser 
65 70 75 80 

His Tyr Ser Ser Phe Gly Ser Ser Gly Gly Ser Gly Gly Gly Ser Met 
85 90 95 
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Met Gly Gly Giu Ser Ala Asp Lys 
100 

Ser Leu Leu Ala Asn Gly His Asp 
115 120 



Ala Thr Ala Ala Ala Ala Ala Ala 
105 no 

Leu Ala Ala Ala Met Ala Val Asp 
125 



Lys Ser Asn Pro Thr Ser Lys His 
130 135 

Leu Ser Lys Ala Glu Arg Ala Thr 
145 150 

Thr Leu Gin Gin Phe Ala Gin Ser 
165 

Gin Glu His Leu Pro Leu Met Ser 
180 



Lys Ser Gly Ala Val Ala Ser Leu 
14 0 

Glu Leu Ala Ala Glu Gly Gin Leu 
155 160 

Thr Glu Met Leu Lys Arg Val Val 
170 175 

Glu Ala Gly Ala Gly Leu Pro Asp 
185 190 



Met Glu Ala Val 
195 

Pro Tyr Leu Gly 
210 

Pro Ala Gly Val 
225 

Ala Glu Tyr Pro 



Lys Lys Lys Arg 
260 

lie Asn Cys Glu 
275 

Gin lie Cys Lys 
290 

Ala Ala Leu Glu 
3 05 

Phe Gin Lys Thr 



Ala Gly Ala Glu 
200 

Ala Phe Pro lie 
215 

Phe Leu Ala Glu 
230 

Met Gin Gly Glu 
245 

Lys Arg Cys Gly 



Gin Cys Ser Ser 
280 

Phe Arg Lys Cys 

295 

Lys Val Met Leu 
310 

Lys lie Leu Lys 
325 



Ala Leu Asn Gly 



Asn Pro Gly Leu 
220 

Ser Ala Leu His 
235 

Leu Ala Ser Ala 
250 

Met Cys Ala Pro 
265 

Cys Arg Asn Arg 



Glu Glu Leu Lys 
300 

Pro Thr Gly Ala 
315 

Glu Met Ser Gly 
330 



Gin Ser Asp Phe 
205 

Phe lie Met Thr 



Met Ala Gly Leu 
240 

lie Ser Ser Gly 
255 

Cys Arg Arg Arg 
270 

Lys Thr Gly His 
285 

Lys Lys Pro Ser 



Ala Phe Arg Trp 
320 

Gin Arg Gin Gly 
335 



Pro Gly His Val Ala Cys Pro Ala Ser Met Ser Leu Gly Gly Gly Val 
340 345 350 

Gly Trp Gly Trp Ala Ala Ala Ser Gin 
355 360 



<210> 15 
<211> 2329 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: cDNA derived 

from human mesenchymal stem cells after treatment 
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with growth, supplement 
<400> 15 

ggcacgagat gacattactc actatcagaa ttgagaaaat tggtttgaaa gatgctgggc 50 
agtgcatcga tccctatatt acagttagtg taaaggatct gaatggcata gacttaactc 120 
ctgtgcaaga tactcctgtg gcttcaagaa aagaagatac atatgttcat tttaatgtgg 180 
acattgagct ccagaagcat gttgaaaaat taaccaaagg tgcagctatc ttctttgaat 240 
tcaaacacta caagcctaaa aaaaggttta ccagcaccaa gtgttttgct ttcatggaga 300 
tggatgaaat taaacctggg ccaattgtaa tagaactata caagaaaccc actgacttta 3 60 
aaagaaagaa attgcaatta ttgaccaaga aaccacttta tcttcatcta catcaaactt 420 
tgcacaagga atgatcctga catgatgaac ctggaacttc tgtgaatttt accactcagt 480 
agaaaccatc atagctctgt gtagcatatt cacccttcaa caggcaggaa gcaagccgta 540 
cccagaccag taggccggac ggagtccaat gcaaagctgt accacagaat tcagagtcca 600 
gcacatcaca ctgacgtata ggactccttg ggatacaggt ttattgtaga ttttgaaaca 660 
tgtttttact tttctattaa ttgtgcaatt aatagtctat tttctaattt accactactc 72 0 
ctaccctgct tcctggaaca atactgttgt gggtaggatg tgctcatctt cagacttaat 78 0 
acagcaataa gaatgtgcta gagtttacac atctgttcac ttttgctcca atatgctctt 84 0 
ttgacttaac gtcaagcttt gggttgatgt gggtagggta gtgtcaaact gctttgagag 900 
gaatgggacc agttctgctg cctaagaagg tctgtctgga tgtttatagg cagcacctct 96 0 
gaagtggcct aaattcaccc tgatctgata gttttcctgc ttagaaagtg tgccttggcc 1020 
agatcagtat cccacatggg agtgttccct aggttgtagc tgtgattgtt tccagatgac 10 8 0 
cagattgttt ttctgaaaat gagcatattt ttagtcatgt cgattagctg ttcttctaca 114 0 
tcacattgtt actctttctg atgatgattc tagggttaac attggaacca tctcaaaata 12 0 0 
attacaaagt tttagatggg tttacaatgt cttctaaaca atgtaatcta aaaataattg 12 6 0 
agtcagatgc taacgagata ctgcaggcat aactgctgtt tttctgacaa ctgattgtga 1320 
aaccttaaaa cctgcatacc tcttcttaca gtgaggagta tgcaaaatct ggaaagatat 138 0 
tctatttttt ttatataggt agataggatc gccatttatt tcctatttag atatactgac 1440 
attcatccat atgaaaatat gcaggtcatt agcttactat aatttacttt tgacttaatg 1500 
gggcataaat aaaactttca tagtacacga ggtggatatt tgatacacag aacatttgcg 1560 
gtgggctttc tgtgggttag atgtaaagcc cacatatttt aatattcact attttaaatg 1620 
agcaatgcat gaggggaatg cagtgtcagt acctggccta tttttaaact agtgtaatca 168 0 
ccctagtcat accattcagt atgtttgctt tttaaaataa gtaaccacaa ttaagttgtt 174 0 
gtagcccttg cacttcaaga gatctagtct ttactttcag ttgtctgtta ggtccattct 18 00 
gtttactaga cggatgttaa taaaaactat gcgagcctga atgaattctc agccaaattt 18 60 
agtcttgtct ctcatcttga ttggattaat tccaaattct aaaatgattc agtccacaat 1920 
agctctaggg gatgaagaat ttgccttact ttgcccagtt cctaagactg tgagttgtca 198 0 
aatccctaga ctgtaagctc ttcaaggagc aagaggcgca ttttctccgt gtcatgtaat 204 0 
ttttctaagg tgcttggcag cactctgacc ctgtggagta ctcagacctt ttgtttgatg 210 0 
ttgctgacaa gacctgaaaa aaaatccctt aaaaaaaaaa acccattaaa gtgtagcaaa 216 0 
accgatgtat tattttttct cttttctcat aaaccataag atatgtatat caaaatgata 222 0 
atttatatga agcatctgct atccttcaca taagtatagt ttatatgaac tacaaatacc 2280 
tttcatcaac attaaagaag cttaaattga aaaaaaaaaa aaaaaaaaa 232 9 



<210> 16 
<211> 141 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO : 15 . 

<400> 16 

Met Thr Leu Leu Thr lie Arg lie Glu Lys lie Gly Leu Lys Asp Ala 
15 io 15 

Gly Gin Cys lie Asp Pro Tyr lie Thr Val Ser Val Lys Asp Leu Asn 
20 25 30 

Gly lie Asp Leu Thr Pro Val Gin Asp Thr Pro Val Ala Ser Arg Lys 
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Glu Asp Thr Tyr Val His Phe Asn Val Asp lie Glu Leu Gin Lys His 

50 55 60 

Val Glu Lys Leu Thr Lys Gly Ala Ala lie Phe Phe Glu Phe Lys His 

65 70 75 80 

Tyr Lys Pro Lys Lys Arg Phe Thr Ser Thr Lys Cys Phe Ala Phe Met 



Glu Met Asp Glu He Lys Pro Gly Pro He Val He Glu Leu Tyr Lys 
100 105 no 

Lys Pro Thr Asp Phe Lys Arg Lys Lys Leu Gin Leu Leu Thr Lys Lys 
115 120 125 

Pro Leu Tyr Leu His Leu His Gin Thr Leu His Lys Glu 
130 135 140 



<210> 17 
<211> 1737 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 



<400> 17 

ggcacgagtc gacatggcgg ttaccctgag 
cgtcactcgc tgtgggttcg cgacccgggg 
gccggacccc gattccgact gggagccgga 
ccggcttctc cccgggccct cagcttgaag 
cgcccccttg gcgccgggtg tcctgccgct 
ccgttgttag gggcgcagcg tcaggtgttc 
gtgacgtatt tcatcatcag tttagagagt 
ggtgttagct ttctgggtgt acggagcagc 
tgcgatcaaa gagaagaggg ctgggcgctc 
cattctgctt ccacgcatgg cttctgccat 
cagaaacaag caatccgatt ccagaaaatt 
cccaggaccc tgacgtggga agccatggag 
gagtcctggt cagttcccag gttggctgaa 
agagttttaa aaagcaagtt tttacccaca 
gtccttaaga aagctgggct tgcccactcg 
tcaaagctgc tccctgcagg ccactctgta 
gcctcatcta aagacccaaa tcacagcaca 
aggacaaata caccaaggag aaggaaggga 
agctttgtgc ctgttgctgc acccctaggt 
gattctgaga gccccagagg aactggcagt 
gagttgaagg cagaggagcc agataacttc 
ttctttgaca gcaacgggaa cttcctgtac 
atgcctcgtg aaacacagct gggcaagtat 
atatggataa gccaccttgg aataggaaga 
gctgcgtgga tagattcaaa cttcctgtgg 
tcagtactca ctcttcttgc ttaggctctc 
tgttgttaca attttctgtg atacttgcaa 
cttctgacct catttccttc ttgatcagtg 
attggttttc cttacaacaa aataaagtaa 



tctcttgctg ggcgggcgcg tttgcgccgc 60 
ggtggcgggc ccaggcccta ttggccggga 120 
ggaacgggag ctgcaggagg tggagaggta 180 
cagggcctcg tgccccggcg ctccaggccg 240 
gcttgcgcag cggcccttgt ttcttcttac 3 00 
agctccccgg gaccactggt cccttcatta 360 
tcggcatgct tacaggcagt tattgttcta 42 0 
tctaagccgg caacatggcc cggttgccct 48 0 
catgatttag cctgaggctc ttcaaacatc 540 
tggttctctt cccccagcac cctgaaacga 600 
cggaggcaaa tggaggcgcc tggtgccccg 660 
cagatacggt atttacatga ggaatttcca 720 
ggctttgatg tcagcactga tgtgatccga 780 
ttggagcaga agctgaagca ggatcaaaaa 840 
ctgcagcacc tccggggctc tggaaatacc 900 
tcaggctctt tgcttatgcc agggcatgaa 960 
gctttgaaag tgatagagtc agacactcac 1020 
agaaataaag aaatccagga cctggaggag 108 0 
catccaagag agctgcagaa gtactccagt 114 0 
ggtgcgttgc caagtggtca gaagctggag 1200 
agcagcaaag tagtgcagag gggccgagag 1260 
agaatttgag tcggggcttg gcttatggag 132 0 
taatgtatat ggaacagcct ggatttctgc 138 0 
ggtgttgagc ctggactgtg ggaggaaaga 1440 
tagtgctccc agtctgacct ctgtagacct 1500 
tgtgtgttga aagccatccc gtgttgcatg 1560 
tttatgtttg agaagaagtg aaaagtttgc 1620 
aacactaaca ttttggggac aacttagtca 1680 
aatgtagcaa aaaaaaaaaa aaaaaaa 1737 
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<210> 18 
<211> 219 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Putative 

protein derived from ORF of cDNA of SEQ ID NO: 17. 

<400> 18 

Met Glu Ala Pro Gly Ala Pro Pro Arg Thr Leu Thr Trp Glu Ala Met 



Glu Gin lie Arg Tyr Leu His Glu Glu Phe Pro Glu Ser Trp Ser Val 

20 25 30 

Pro Arg Leu Ala Glu Gly Phe Asp Val Ser Thr Asp Val lie Arg Arg 

35 40 45 

Val Leu Lys Ser Lys Phe Leu Pro Thr Leu Glu Gin Lys Leu Lys Gin 

50 55 60 

Asp Gin Lys Val Leu Lys Lys Ala Gly Leu Ala His Ser Leu Gin His 

65 70 75 80 

Leu Arg Gly Ser Gly Asn Thr Ser Lys Leu Leu Pro Ala Gly His Ser 



Val Ser Gly Ser Leu Leu Met Pro Gly His Glu Ala Ser Ser Lys Asp 
100 105 110 

Pro Asn His Ser Thr Ala Leu Lys Val lie Glu Ser Asp Thr His Arg 
115 120 125 

Thr Asn Thr Pro Arg Arg Arg Lys Gly Arg Asn Lys Glu lie Gin Asp 
130 135 140 

Leu Glu Glu Ser Phe Val Pro Val Ala Ala Pro Leu Gly His Pro Arg 
145 150 155 160 

Glu Leu Gin Lys Tyr Ser Ser Asp Ser Glu Ser Pro Arg Gly Thr Gly 
165 170 175 

Ser Gly Ala Leu Pro Ser Gly Gin Lys Leu Glu Glu Leu Lys Ala Glu 
180 185 190 

Glu Pro Asp Asn Phe Ser Ser Lys Val Val Gin Arg Gly Arg Glu Phe 
195 200 205 

Phe Asp Ser Asn Gly Asn Phe Leu Tyr Arg lie 
210 215 



<210> 19 
<211> 369 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 19 

cattaggtgt caccaaagtg ttaacaattc cttgggaatt tataaacaaa aaatatctag 6 0 
actaaaaata gaatataagg cccctctgga accactgcac acctttccct ctgttctcag 12 0 
attaaactgc tgcataaaat gaggttattt agcctgtttt taaggaaggc ttctattgca 18 0 
catgcatctc tttggttggt aataacatgt acttttgtta aaatatttcc ataatgaagc 240 
cctgttgtgt ggttagctgg gtgtggactt tcctcccttt cttgggggcc ctcctactca 3 00 
cagtcaagtt gccctttaga actaaagatc tggtaggatt gggtctttat tcaatagccc 3 60 
ctcgtgccg 3 69 



<210> 20 
<211> 48 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 19. 

<400> 20 

Glu Ser lie Arg Cys His Gin Ser Val Asn Asn Ser Leu Gly lie Tyr 



Lys Gin Lys lie Ser Arg Leu Lys lie Glu Tyr Lys Ala Pro Leu Glu 



Pro Leu His Thr Phe Pro Ser Val Leu Arg Leu Asn Cys Cys lie Lys 
35 40 45 



<210> 21 
<211> 2133 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 21 

cggcacgaga aatcatttaa tttaaggatt gcagcttaag acttggcata cggatacttg 60 
gatgacactg atttcttctg tagctgggca aatcagtttc ctcaccaaaa gtgaatatgg 120 
taaaattaca tgtttatatc atatgattgt gagactcaat aatagtttga gagctcagta 18 0 
ttatctttgt aggaaagaca taatttccaa atttgattag aatttgaaaa taaatcacat 24 0 
tttgatcatt gctagatact gtgtctaaat gtggaaaaac ctttgagaag gatttttgta 300 
tttttttttg actatttcaa gtcaatagag aaaagaattc tgttgaaatc agtgaagtgg 36 0 
tgataagtag tgggtggaag attacagctc agacttaatg ggatagattt ggaatattat 420 
agatgggtga atttattaca aagcaaatat taccccaaat tgtcctctaa gaaaatcctc 480 
ccccaatctg ttaatgaaaa aaaaattgtg gtacaatcct tataaaattt accattttag 540 
ccattttaaa gtcttaatgc cagttcagtg gcattaagta cattcatttt gctttgttac 600 
tatcactgcc atccatctcc agaaagcttt tcatcttgaa aatctgttac cgtacttatt 66 0 
taacaataag tccttatatc cctctctcct cagcccctgg caaccaccat cctactttct 72 0 
atctctgaat tcgactcttc tattaatagg tacctcatat aagtggaaca tagagtattt 78 0 
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gccctttttt gtgactggct tatttcactt agcataatgt gcccaagagt catatatgtt 84 0 

agcatgcgtc aggattttat ttcttttaaa ggctaaataa tgttccattg tatgtatata 90 0 

ccatattttg cttattcttt caccttccat ggacatttag gttgtttcta ccttgtaggt 960 

actgtgaatc atgcagccat aaacattggt gttcaaatat atgttccagc ttctgctttc 102 0 

aattcttttg catatatacc cagaagttaa attgctggat cacatagtca ttctattttt 1080 

aatttttttg aggaactgtc atattgtttt ctatagtggc tgcaccacca aaattttgaa 1140 

tggagagaat gtcagcatcc tgtttaataa attattttaa aaatcagctc tgagcaaatg 1200 

aatttagaat ttggaatatt tgaccaaaaa tcagttttac acgtacatag tatgttactg 1260 

ataaaggaat gactagagta gtacacctac agttcttgct ggccagagcc aggagggggt 1320 

gtaaaggaat ttcctgggtt taaaaaagca ggcagcattg atgaatcatg aagttccccc 13 8 0 

tacctccttt aatggttttg taacaaagag atgcattcag gtgaaagttt caagtctttt 144 0 

tgtaactata gaataactgt tctttttgta attgtagaat tctttaactt tttttaggct 1500 

gaagtagaga acatgaatca tttggtctga aacttttgag accctatttg gtttgcttac 156 0 

atatgtgaaa atagttatcc aaataattgc gattgctatc aaataatttc aataatatca 162 0 

gttacttctt ttggtctaga gtcttctcct ccttaacctc tttctttttg gcgagacagg 16 8 0 

gtctcactct gttacccagg ctggagtgca atggagtgat cactgcccct ttgaacttct 174 0 

gggctcaagt gatcctctca cctcagcttc ctgagtaggt gggactacag atgtgtgcca 1800 

ccaagcctgg ctaattaaaa acaaatgttt aagccaggca tggtggctca cacctgtaat 18 6 0 

ttcagcacct tgggaggccg aggcaggcgg atcacttgag gtcaggagtt caagaccagc 192 0 

ctggccaaga tggtggaacc ctgcctctac taaaaacaca cacacaaatt agccgggcat 198 0 

ggtggctggt gtctgtaatc ccagctgtgg gaggttgagg caggagaatc acttgaacct 2040 

gggaggtgga ggtttcagtg agccaagatc atttcacctc actccagcct gggtgacagc 210 0 

gtaagactct ctcaaaaaaa aaaaaaaaaa aaa 2133 



<210> 22 
<211> 56 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 21. 

<400> 22 

Met Phe His Cys Met Tyr lie Pro Tyr Phe Ala Tyr Ser Phe Thr Phe 
15 10 15 

His Gly His Leu Gly Cys Phe Tyr Leu Val Gly Thr Val Asn His Ala 
20 25 30 

Ala lie Asn He Gly Val Gin He Tyr Val Pro Ala Ser Ala Phe Asn 



Ser Phe Ala Tyr He Pro Arg Ser 
50 55 



<210> 23 
<211> 1200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 23 

ggcacgaggt agcagcagca tggccgcgat ctatgggggt gtagaggggg gaggcacacg 6 0 
atccgaggtc cttttagtct cagaggatgg gaagatcctg gcagaagcag atggactgag 120 
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cacaaaccac tggctgatcg 
cagggccaaa cggaaagcag 
tctgagcggt ggggaccagg 
atttccctac ctgagtgaaa 
agctacaccg gatggtggag 
caaccctgat ggctccgaga 
ttcagcctac tggatcgcac 
agaggcggct cctcatgata 
gccagatcgg ctagggatac 
tgggttttgc cggaaaattg 
cttcaggaag gctggggaga 
cccggtcttg ttccagggca 
gagctgggag ctgctgaagg 
ggctcagaac ttcttctcca 
tggggccagc ctaggggcca 
tgccattgcc ttctattcct 
caagctcagt ggacactggg 
caaaaacaaa catagaagaa 



ggacagacaa gtgtgtggag 
gggtggatcc tctggtaccg 
aggacgcggg gaggatcctg 
gctacttaat caccaccgat 
ttgtgctcat atctggaaca 
gtggctgcgg cggctggggc 
accaagcagt gaaaatagtg 
tcggctacgt caaacaggcc 
tcactcacct gtatagggac 
cagaaggtgc tcagcaggga 
tgctgggcag acacatcgta 
agattggact ccccatcctg 
aaggttttct tttggcgctg 
gcttcaccct gatgaagctg 
ggcacatcgg gcacctcctc 
acaccttttc ctagggggct 
tctgaaagga aggagtcttt 
aataaatgca ctttatccaa 



aggatcaatg agatggtgaa 18 0 
ctgcgaagct tgggcctatc 240 
atcgaggagc tgagggaccg 3 00 
gccgccggct ccatcgccac 3 60 
ggctccaact gcaggctcat 420 
catatgatgg gtgatgaggg 48 0 
tttgactcca ttgacaacct 540 
atgttccact atttccaggt 600 
tttgataaat gcaggtttgc 660 
gacccccttt cccgctatat 720 
gcagtgttgc ccgagattga 780 
tgcgtgggct ctgtatggaa 84 0 
acccagggca gagagatcca 900 
aggcactcct ccgctctggg 960 
cccatggact atagcgccaa 1020 
ggtcccggct ccaccccctc 1080 
tgcttccttt ctccttttta 1140 
aaaaaaaaaa aaaaaaaaaa 12 00 



<210> 24 
<211> 344 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Putative 

protein derived from ORF of cDNA of SEQ ID NO: 23. 

<400> 24 

Met Ala Ala lie Tyr Gly Gly Val Glu Gly Gly Gly Thr Arg Ser Glu 
15 10 15 

Val Leu Leu Val Ser Glu Asp Gly Lys lie Leu Ala Glu Ala Asp Gly 
20 25 30 

Leu Ser Thr Asn His Trp Leu lie Gly Thr Asp Lys Cys Val Glu Arg 
35 40 45 

lie Asn Glu Met Val Asn Arg Ala Lys Arg Lys Ala Gly Val Asp Pro 
50 55 60 

Leu Val Pro Leu Arg Ser Leu Gly Leu Ser Leu Ser Gly Gly Asp Gin 
65 70 75 80 

Glu Asp Ala Gly Arg lie Leu lie Glu Glu Leu Arg Asp Arg Phe Pro 
85 90 95 

Tyr Leu Ser Glu Ser Tyr Leu lie Thr Thr Asp Ala Ala Gly Ser lie 
100 105 110 

Ala Thr Ala Thr Pro Asp Gly Gly Val Val Leu lie Ser Gly Thr Gly 

115 120 125 

Ser Asn Cys Arg Leu lie Asn Pro Asp Gly Ser Glu Ser Gly Cys Gly 
130 135 140 

Gly Trp Gly His Met Met Gly Asp Glu Gly Ser Ala Tyr Trp lie Ala 
145 150 155 160 

His Gin Ala Val Lys lie Val Phe Asp Ser lie Asp Asn Leu Glu Ala 
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165 170 175 

Ala Pro His Asp lie Gly Tyr Val Lys Gin Ala Met Phe His Tyr Phe 
180 185 190 

Gin Val Pro Asp Arg Leu Gly lie Leu Thr His Leu Tyr Arg Asp Phe 
195 200 205 

Asp Lys Cys Arg Phe Ala Gly Phe Cys Arg Lys lie Ala Glu Gly Ala 
210 215 220 

Gin Gin Gly Asp Pro Leu Ser Arg Tyr lie Phe Arg Lys Ala Gly Glu 
225 230 235 240 

Met Leu Gly Arg His lie Val Ala Val Leu Pro Glu lie Asp Pro Val 
245 250 255 

Leu Phe Gin Gly Lys lie Gly Leu Pro lie Leu Cys Val Gly Ser Val 
260 265 270 

Trp Lys Ser Trp Glu Leu Leu Lys Glu Gly Phe Leu Leu Ala Leu Thr 
275 280 285 

Gin Gly Arg Glu lie Gin Ala Gin Asn Phe Phe Ser Ser Phe Thr Leu 
290 295 300 

Met Lys Leu Arg His Ser Ser Ala Leu Gly Gly Ala Ser Leu Gly Ala 
305 310 315 320 

Arg His lie Gly His Leu Leu Pro Met Asp Tyr Ser Ala Asn Ala lie 
325 330 335 

Ala Phe Tyr Ser Tyr Thr Phe Ser 
340 



<210> 25 
<211> 2216 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA derived 

from human mesenchymal stem cells after treatment 
with growth supplement 

<400> 25 

ggcacgaggg gcaccgtaac cagcgccgcg gacaccggca ccggcgccac ggactccgca 60 
ggaccccgcg cccgccgccg ccgctatgct ggggctgctg gtggcgttgc tggccctggg 120 
gctcgctgtc tttgcgctgc tggacgtctg gtacctggtg cgccttccgt gcgccgtgct 18 0 
gcgcgcgcgc ctgctgcagc cgcgcgtccg tgacctgcta gctgagcagc gcttcccggg 240 
ccgcgtgctg ccctcggact tggacctgct gttgcacatg aacaacgcgc gctacctgcg 3 00 
cgaggccgac tttgcgcgcg tcgcgcacct gacccgctgc ggggtgctcg gggcgctgag 360 
ggagttgcgg gcgcacacgg tgctggcggc ctcgtgcgcg cgccaccgcc gctcgctgcg 420 
cctgctggag cccttcgagg tgcgcacccg cctgctgggc tgggacgacc gcgcgttcta 480 
cctggaggcg cgctttgtca gcctgcggga cggtttcgtg tgcgcgctgc tgcgcttccg 540 
gcagcacctg ctgggcacct cacccgagcg cgtcgtgcag cacctgtgcc agcgcagggt 6 00 
ggagccccct gagctgcccg ctgatctgca gcactggatc tcctacaacg aggccagcag 660 
ccagctgctc cgcatggaga gtgggctcag tgatgtcacc aaggaccagt gaccgccacc 720 
ttcacaccgt ctgccctggc caccatcctg ggcctggggg ctgcccacag atgggcagtc 780 
tcagccatac tctgttccag ctggagtagc ctcctgacca gcctggccca ccctgctcca 840 
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cccactgggc ccccccagtt attgataccc 
gaggagtggc attggcatcc tgacccagct 
ggagagtcct gcctggccct acgatgaggc 
gtgcctggag cagagggacc cacaagtgcc 
gcaagggccg aggcagggcc agaccagagc 
cctgggcctg attcaggtgc agtgggcact 
agcccctctg gcttccgctg aggtggttgc 
gccggggcta gaggcagatg gaagggaggc 
tccccactgc agtcggcctt gcctcctcct 
agggtagcct tgtggctgag aggggtcaga 
gcaggcttcc tgctgggcgt tcccttgttg 
ggggtgtcct tgtccactcg cacccctcgc 
gctgatcccc accctttctg ggcctacagc 
cccctctctg gcctggccgg caggcgtctt 
gttccttgtc agctgctggg ccggccctgc 
tctcaggtct ccctaatgga gacaccaggc 
ctgcctaatg cctcacctta cagctgggga 
aaggcaaagt tggggctggt cccagcctga 
agggggtgcc ccatctagct gggtgcaggg 
ccgcttgcct gttcccctac atctgtgcct 
ctccaggaag cccaccttga gccagagtca 
ccgccctcac tgctgttctt gccttacagc 
tatttaaaga aagcccaaca ttaaagggtt 



ctctgtgctg ggctccacgc taggcagaag 900 
ctgccctcaa ggtggggatg gatgggcaaa 960 
cactcatgtg ggcctaggta ggggaggatg 1020 
tcccgagcct agatcctggc tcggaccact 1080 
atcctgggta caggcctggg ctctccaggg 114 0 
cctgaagggt cagagcggca tctgccaggc 1200 
aggcctgggg cagagcctgg gtggtcagag 12 60 
atttgctgac agaggacggg gcacccgggc 1320 
cctcctctac ctccagtcag gctggacggg 13 80 
ctaggtggca caggggctcc tggaaagaca 1440 
gagggaatag agtgggggtg ggactctgca 1500 
cgcccaccag ggccatgctc tgtgacttgg 1560 
accacaggcc gctgtacccc cttagagctg 162 0 
cttaactcct ctgtcctcta tattcagcat 1680 
cttgcgctag cagagcctct cctggcagct 1740 
tactaggaca ctggctgggg ccaccccctc 1800 
aactgaggcc tggaatggcc cagagtcacc 18 6 0 
ggctccagct gatgccctca gctcccagag 1920 
gtcactgctt gtcagctcag ggccctgtgc 198 0 
gcacatccag aactgcctcc ttgccgctgc 204 0 
agggctgcag cactgcccga tagaacacgc 210 0 
caccatggga aagctgcaac ctttctgttt 216 0 
ttcattgcaa aaaaaaaaaa aaaaaa 2216 



<210> 26 
<211> 208 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Putative 

protein derived from ORF of cDNA of SEQ ID NO: 25. 

<400> 26 

Met Leu Gly Leu Leu Val Ala Leu Leu Ala Leu Gly Leu Ala Val Phe 
15 10 15 

Ala Leu Leu Asp Val Trp Tyr Leu Val Arg Leu Pro Cys Ala Val Leu 
20 25 30 

Arg Ala Arg Leu Leu Gin Pro Arg Val Arg Asp Leu Leu Ala Glu Gin 
35 40 45 



Arg Phe Pro Gly Arg Val Leu Pro 
50 55 

Met Asn Asn Ala Arg Tyr Leu Arg 
65 70 

His Leu Thr Arg Cys Gly Val Leu 
85 

His Thr Val Leu Ala Ala Ser Cys 
100 

Leu Leu Glu Pro Phe Glu Val Arg 
115 120 

Arg Ala Phe Tyr Leu Glu Ala Arg 
130 135 



Ser Asp Leu Asp Leu Leu Leu His 
60 

Glu Ala Asp Phe Ala Arg Val Ala 
75 80 

Gly Ala Leu Arg Glu Leu Arg Ala 
90 95 

Ala Arg His Arg Arg Ser Leu Arg 
105 110 

Thr Arg Leu Leu Gly Trp Asp Asp 
125 

Phe Val Ser Leu Arg Asp Gly Phe 
140 
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Val Cys Ala Leu 
145 

Glu Arg Val Val 



Leu Pro Ala Asp 
180 

Gin Leu Leu Arg 
195 



Leu Arg Phe Arg 
150 

Gin His Leu Cys 
165 

Leu Gin His Trp 



Met Glu Ser Gly 
200 



Gin His Leu Leu 
155 

Gin Arg Arg Val 
170 

lie Ser Tyr Asn 
185 

Leu Ser Asp Val 



Gly Thr Ser Pro 
160 

Glu Pro Pro Glu 
175 

Glu Ala Ser Ser 
190 

Thr Lys Asp Gin 
205 



<210> 27 
<211> 2453 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Consensus 

sequence of the coding strand of cDNAs from human 
mesenchymal stem cell genes . 

<400> 27 

gaattcggca cgaggtcgcg gcggcgaagg aggaggagga agagggcgag gcgacaagag 60 
aagaaggagg caggcgcggc ggcagcggcg gcgccccgag ccggcggagg cgaggggggg 120 
gaagatggcg gacgtgctta gcgtcctgcg acagtacaac atccagaaga aggagattgt 180 
ggtgaaggga gacgaagtga tcttcgggga gttctcctgg cccaagaatg tgaagaccaa 240 
ctatgttgtt tgggggactg gaaaggaagg ccaacccaga gagtactaca cattggattc 3 00 
cattttattt ctacttaata acgtgcacct ttctcatcct gtttatgtcc gacgtgcagc 3 60 
tactgaaaat attcctgtgg ttagaagacc tgatcgaaaa gatctacttg gatatctcaa 42 0 
tggtgaagcg tcaacatcgg caagtataga cagaagcgct cccttagaaa taggtcttca 48 0 
gcgatctact caagtcaaac gagctgcaga tgaagtttta gcagaagcaa agaaaccacg 54 0 
aattgaggat gaagagtgtg tgcgccttga taaagagaga ttggctgccc gtttggaggg 600 
tcacaaagaa gggattgtac agactgaaca gattaggtct ttgtctgaag ctatgtcagt 66 0 
ggaaaaaatt gctgcaatca aagccaaaat tatggctaag aaaagatcta ctatcaagac 720 
tgatctagat gatgacataa ctgcccttaa acagaggagt tttgtggatg ctgaggtaga 780 
tgtgacccga gatattgtca gcagagagag agtatggagg acacgaacaa ctatcttaca 840 
aagcacagga aagaattttt ccaagaacat ttttgcaatt cttcaatctg taaaagccag 9 00 
agaagaaggg cgtgcacctg aacagcgacc tgccccaaat gcagcacctg tggatcccac 960 
tttgcgcacc aaacagccta tcccagctgc ctataacaga tacgatcagg aaagattcaa 1020 
aggaaaagaa gaaacggaag gcttcaaaat tgacactatg ggaacctacc atggtatgac 1080 
actgaaatct gtaacggagg gtgcatctgc ccggaagact cagactcctg cagcccagcc 1140 
agtaccaaga ccagtttctc aagcaagacc tcccccaaat cagaagaaag gatctcgaac 1200 
acccattatc ataattcctg cagctaccac ctctttaata accatgctta atgcaaaaga 1260 
ccttctacag gacctgaaat ttgtcccatc agatgaaaag aagaaacaag gttgtcaacg 1320 
agaaaatgaa actctaatac aaagaagaaa agaccagatg caaccagggg gcactgcaat 1380 
tagtgttaca gtaccttata gagtagtaga ccagcccctt aaacttatgc ctcaagactg 1440 
ggaccgcgtt gtagccgttt ttgtgcaggg tcctgcatgg cagttcaaag gttggccatg 1500 
gcttttgcct gatggatcac cagttgatat atttgctaaa attaaagcct tccatctgaa 1560 
gtatgatgaa gttcgtctgg atccaaatgt tcagaaatgg gatgtaacag tattagaact 1620 
cagctatcac aaacgtcatt tggatagacc agtgttctta cggttttggg aaacattgga 1680 
caggtacatg gtaaagcata aatcgcactt gagattctga attatttggc tcctccattt 174 0 
ctggaaattg agactcaagc tttatgaatt tatcaagaac ttaaaaatga agaaggtcac 18 00 
agattgatct tttataagac cttatttgat gctttgtgct tcaaggagat gatacctgtc 1860 
atccatataa gcaaactttt tggcttacaa ctattttttt aatattagcc ttctagtctg 1920 
taatggaaat tgtatatttt gatagaagtt ttttctccat tggttaaatt agcattactt 1980 
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aaaatttgtt tctttagaaa ataaatgcag gttataaatg tgtgtatatt tagagattat 2 04 0 

aaggctctct gagccatctt ctgatttttc attgctctat aattcttttt actgaaaata 2100 

ctatgttatg aatggtatta aattttagtc tctggaacat ccaaaaccaa gcaaagggat 216 0 

gtgactattt tgaatgaatc agaatgtcaa cttgtatgta cactatatct acacttactc 2220 

attatttaaa aagaataatg aaaaatctag atcaattctt caatttgatt gaactgttca 2280 

gccttttcaa gatttcttta tttacaaatg attacattta aatgaatgta cattcttctc 2340 

actgactttg gtgattttga aacctagaat gatgtgtttc tatctgtaat atctttccat 24 00 

ttgaaaaaaa tctcaaaaca cagattaaaa ccacaaaaaa aaaaaaaaaa aaa 24 53 



<210> 28 
<211> 1593 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Coding sequence 
for the protein of SEQ ID NO; 29 and corresponding 
to residues 125-1717 of SEQ ID NO: 27 

<400> 28 

atggcggacg tgcttagcgt cctgcgacag tacaacatcc agaagaagga gattgtggtg 6 0 

aagggagacg aagtgatctt cggggagttc tcctggccca agaatgtgaa gaccaactat 12 0 

gttgtttggg ggactggaaa ggaaggccaa cccagagagt actacacatt ggattccatt 180 

ttatttctac ttaataacgt gcacctttct catcctgttt atgtccgacg tgcagctact 24 0 

gaaaatattc ctgtggttag aagacctgat cgaaaagatc tacttggata tctcaatggt 3 00 

gaagcgtcaa catcggcaag tatagacaga agcgctccct tagaaatagg tcttcagcga 3 60 

tctactcaag tcaaacgagc tgcagatgaa gttttagcag aagcaaagaa accacgaatt 42 0 

gaggatgaag agtgtgtgcg ccttgataaa gagagattgg ctgcccgttt ggagggtcac 48 0 

ciaagaaggga ttgtacagac tgaacagatt aggtctttgt ctgaagctat gtcagtggaa 54 0 

aaaattgctg caatcaaagc caaaattatg gctaagaaaa gatctactat caagactgat 60 0 

ctagatgatg acataactgc ccttaaacag aggagttttg tggatgctga ggtagatgtg 66 0 

cLCCcgagata ttgtcagcag agagagagta tggaggacac gaacaactat cttacaaagc 72 0 

acaggaaaga atttttccaa gaacattttt gcaattcttc aatctgtaaa agccagagaa 780 

gaagggcgtg cacctgaaca gcgacctgcc ccaaatgcag cacctgtgga tcccactttg 84 0 

cgcaccaaac agcctatccc agctgcctat aacagatacg atcaggaaag attcaaagga 900 

aaagaagaaa cggaaggctt caaaattgac actatgggaa cctaccatgg tatgacactg 960 

aaatctgtaa cggagggtgc atctgcccgg aagactcaga ctcctgcagc ccagccagta 102 0 

ccaagaccag tttctcaagc aagacctccc ccaaatcaga agaaaggatc tcgaacaccc 1080 

attatcataa ttcctgcagc taccacctct ttaataacca tgcttaatgc aaaagacctt 1140 

ctacaggacc tgaaatttgt cccatcagat gaaaagaaga aacaaggttg tcaacgagaa 12 00 

ciatgaaactc taatacaaag aagaaaagac cagatgcaac cagggggcac tgcaattagt 12 60 

gttacagtac cttatagagt agtagaccag ccccttaaac ttatgcctca agactgggac 1320 

cgcgttgtag ccgtttttgt gcagggtcct gcatggcagt tcaaaggttg gccatggctt 13 80 

ttgcctgatg gatcaccagt tgatatattt gctaaaatta aagccttcca tctgaagtat 1440 

gatgaagttc gtctggatcc aaatgttcag aaatgggatg taacagtatt agaactcagc 1500 

tatcacaaac gtcatttgga tagaccagtg ttcttacggt tttgggaaac attggacagg 1560 

tacatggtaa agcataaatc gcacttgaga ttc 1593 



<210> 29 
<211> 531 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Amino acid 

sequence derived from the nucleotide sequence of 
SEQ ID NO: 28 

<400> 29 
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Met Ala Asp Val Leu Ser Val Leu Arg Gin Tyr Asn lie Gin Lys Lys 
15 10 15 

Glu lie Val Val Lys Gly Asp Glu Val lie Phe Gly Glu Phe Ser Trp 
20 25 30 

Pro Lys Asn Val Lys Thr Asn Tyr Val Val Trp Gly Thr Gly Lys Glu 
35 40 45 

Gly Gin Pro Arg Glu Tyr Tyr Thr Leu Asp Ser He Leu Phe Leu Leu 
50 55 60 

Asn Asn Val His Leu Ser His Pro Val Tyr Val Arg Arg Ala Ala Thr 
65 70 75 80 

Glu Asn He Pro Val Val Arg Arg Pro Asp Arg Lys Asp Leu Leu Gly 
85 90 95 

Tyr Leu Asn Gly Glu Ala Ser Thr Ser Ala Ser He Asp Arg Ser Ala 
100 105 110 

Pro Leu Glu He Gly Leu Gin Arg Ser Thr Gin Val Lys Arg Ala Ala 
115 120 125 

Asp Glu Val Leu Ala Glu Ala Lys Lys Pro Arg lie Glu Asp Glu Glu 
130 135 140 

Cys Val Arg Leu Asp Lys Glu Arg Leu Ala Ala Arg Leu Glu Gly His 
145 150 155 160 

Lys Glu Gly He Val Gin Thr Glu Gin He Arg Ser Leu Ser Glu Ala 
165 170 175 

Met Ser Val Glu Lys He Ala Ala He Lys Ala Lys He Met Ala Lys 
180 185 190 

Lys Arg Ser Thr He Lys Thr Asp Leu Asp Asp Asp He Thr Ala Leu 
195 200 205 

Lys Gin Arg Ser Phe Val Asp Ala Glu Val Asp Val Thr Arg Asp He 
210 215 220 

Val Ser Arg Glu Arg Val Trp Arg Thr Arg Thr Thr He Leu Gin Ser 
22 5 230 235 240 

Thr Gly Lys Asn Phe Ser Lys Asn He Phe Ala He Leu Gin Ser Val 
245 250 255 

Lys Ala Arg Glu Glu Gly Arg Ala Pro Glu Gin Arg Pro Ala Pro Asn 
260 2S5 270 

Ala Ala Pro Val Asp Pro Thr Leu Arg Thr Lys Gin Pro He Pro Ala 
275 280 285 

Ala Tyr Asn Arg Tyr Asp Gin Glu Arg Phe Lys Gly Lys Glu Glu Thr 
290 295 300 

Glu Gly Phe Lys He Asp Thr Met Gly Thr Tyr His Gly Met Thr Leu 
305 310 315 320 

Lys Ser Val Thr Glu Gly Ala Ser Ala Arg Lys Thr Gin Thr Pro Ala 
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Ala Gin Pro Val Pro Arg Pro Val Ser Gin Ala Arg Pro Pro Pro Asn 
340 345 350 

Gin Lys Lys Gly Ser Arg Thr Pro lie lie lie He Pro Ala Ala Thr 
355 360 365 

Thr Ser Leu lie Thr Met Leu Asn Ala Lys Asp Leu Leu Gin Asp Leu 
370 375 380 

Lys Phe Val Pro Ser Asp Glu Lys Lys Lys Gin Gly Cys Gin Arg Glu 
385 390 395 400 

Asn Glu Thr Leu He Gin Arg Arg Lys Asp Gin Met Gin Pro Gly Gly 
405 410 415 

Thr Ala He Ser Val Thr Val Pro Tyr Arg Val Val Asp Gin Pro Leu 
420 425 430 

Lys Leu Met Pro Gin Asp Trp Asp Arg Val Val Ala Val Phe Val Gin 
435 440 445 

Gly Pro Ala Trp Gin Phe Lys Gly Trp Pro Trp Leu Leu Pro Asp Gly 
450 455 460 

Ser Pro Val Asp lie Phe Ala Lys He Lys Ala Phe His Leu Lys Tyr 
465 470 475 480 

Asp Glu Val Arg Leu Asp Pro Asn Val Gin Lys Trp Asp Val Thr Val 
485 490 495 

Leu Glu Leu Ser Tyr His Lys Arg His Leu Asp Arg Pro Val Phe Leu 
500 505 510 

Arg Phe Trp Glu Thr Leu Asp Arg Tyr Met Val Lys His Lys Ser His 
515 520 525 

Leu Arg Phe 
530 
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What is Claimed Is: 

1 . An isolated nucieic acid comprising a polynucleotide that is 
at least 90% identical to a polynucleotide encoding a polypeptide 

5 comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 29. 

2. An isolated nucleic acid comprising a polynucleotide that is 
at least 95% identical to a polynucleotide encoding a polypeptide 

10 comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOS: 2, 4, 6, 8, 1 0, 1 2, 1 4, 1 6, 1 8, 20, 22, 24, 26, and 29. 

3. An isolated nucleic acid comprising a polynucleotide that is' 
15 at least 98% identical to a polynucleotide encoding a polypeptide 

comprising an amino acid sequence selected from the group consisting of 
SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, and 29. 

20 4. An isolated nucleic acid comprising RNA corresponding to 

any of the DNA sequences or fragments of claims 1 , 2 or 3. 

5. An isolated nucleic acid comprising a DNA sequence identical 
to a sequence selected from the group consisting of SEQ ID NOS: 1 , 3, 5, 

25 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27 and 28 and the complements of 
these. 

6. An isolated nucleic acid comprising RNA corresponding to 
the DNA sequence of Claim 5. 

30 



40 



WO 00/59933 



PCT/US00/08751 



7. An isolated nucleic acid comprising at least the polypeptide 
coding region of a human gene, said human gene containing a DNA 
sequence according to Claim 1 . 

5 8. An isolated nucleic acid comprising at least the polypeptide 

coding region of a human gene which contains the DNA sequence of 
Claim 5. 

9. The isolated nucleic acid of claim 8 which expresses a 
10 human protein when in a suitable expression system. 

1 0. A vector comprising the DNA sequence of claim 1 . 

11. A vector comprising the DNA sequence of claim 3. 

15 

1 2. A vector comprising the DNA sequence of claim 5. 

13. A vector comprising the DNA sequence of claim 9. 

20 14. A polypeptide coded for by the DNA sequence of claim 7 and 

active fragments, derivatives and functional analogs thereof. 

15. A polypeptide coded for by the DNA sequence of claim 8 and 
active fragments, derivatives and functional analogs thereof. 

25 

16. A polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, and 29. 

3 0 17. A genetically engineered cell having inserted into the genome 

thereof the DNA of Claim 7. 



41 



WO 00/59933 



PCT/USOO/08751 



18. A process for producing ceiis for expressing a polypeptide 
using genetically engineered cells of claim 17. 

19. An isolated DNA sequence comprising a fragment of a DNA 
5 of claim 5, wherein said fragment comprises at least 1 5 sequential bases 

of said sequence. 

20. An isolated DNA sequence comprising a fragment of DNA of 
claim 5, wherein said fragment comprises at least 30 sequential bases of 

10 said sequence. 

21 . An isolated DNA sequence comprising a fragment of DNA of 
claim 5, wherein said fragment comprises at least 50 sequential bases of 
said sequence. 

15 

22. An isolated DNA sequence comprising a fragment of DNA of 
claim 5, wherein said fragment comprises at least 80 sequential bases of 
said sequence. 

2 0 23. A method of detecting genes within the human genome 

comprising contacting a sample of said genome with an isolated DNA 
selected from the group consisting of the DNAs of claims 19, 20, 21, and 
22. 

25 24. A monoclonal antibody against a polypeptide selected from the 

group consisting of the polypeptides of claims 14 and 15. 
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Figure 1 



GAATTCGGCA CGAGGTCGCG GCGGCGAAGG AGGAGGAGGA AGAGGGCGAG GCGACAAGAG SO 

AAGAAGGAGG CAGGCGCGGC GGCAGCGGCG GCGCCCCGAG CCGGCGGAGG CGAGGGGGGG 12 0 

GAAGATGGCG GACGTGCTTA GCGTCCTGCG ACAGTACAAC AT C C AGAAGA AGGAGATTGT 180 

GGTGAAGGGA GACGAAGTGA TCTTCGGGGA GTTCTCCTGG CCCAAGAATG TGAAGACCAA 240 

CTATGTTGTT TGGGGGACTG GAAAGGAAGG CCAACCCAGA GAGTACTACA CATTGGATTC 3 00 

CATTTTATTT CTACTTAATA ACGTGCACCT TTCTCATCCT GTTTATGTCC GACGTGCAGC 3 60 

TACTGAAAAT ATTCCTGTGG TTAGAAGACC TGATCGAAAA GAT CTACTTG GATATCTCAA 420 

TGGTGAAGCG TCAACATCGG CAAGTATAGA CAGAAGCGCT CCCTTAGAAA TAGGTCTTCA 4 80 

GCGATCTACT CAAGTCAAAC GAGCTGCAGA TGAAGTTTTA GCAGAAGCAA AGAAACCACG 540 

AATTGAGGAT GAAGAGTGTG TGCGC CTTGA TAAAGAGAGA TTGGCTGCCC GTTTGGAGGG 6 00 

TCACAAAGAA GGGATTGTAC AGACTGAACA GATTAGGTCT TTGTCTGAAG CTATGTCAGT 660 

GGAAAAAATT GCTGCAATCA AAGCCAAAAT TATGGCTAAG AAAAGATCTA CTATCAAGAC 720 

TGAT CTAGAT GATGACATAA CTGCCCTTAA ACAGAGGAGT TTTGTGGATG CTGAGGTAGA 780 

TGTGACCCGA GATATTGTCA GCAGAGAGAG AGTATGGAGG ACACGAACAA CTATCTTACA 84 0 

.AAGCACAGGA AAGAATTTTT CCAAGAACAT TTTTGCAATT CTTCAATCTG TAAAAGCCAG 9 00 

AGAAGAAGGG CGTGCACCTG AACAGCGACC TGCCCCAAAT GCAGCACCTG TGGATC CCAC 96 0 

TTTGCGCACC AAACAGCCTA TCCCAGCTGC CTATAACAGA TACGATCAGG AAAGATTCAA 1020 

AGGAAAAGAA GAAACGGAAG GCTTCAAAAT TGACACTATG GGAACCTACC ATGGTATGAC 1080 

ACTGAAATCT GTAACGGAGG GTGCATCTGC CCGGAAGACT CAGACTCCTG CAGCCCAGCC 114 0 

AGTACCAAGA CCAGTTTCTC AAGCAAGACC TCCCCCAAAT CAGAAGAAAG GATCTCGAAC 12 00 

ACCCATTATC ATAATTCCTG CAGCTACCAC CTCTTTAATA ACCATGCTTA ATGCAAAAGA 1260 

CCTTCTACAG GACCTGAAAT TTGTCCCATC AGATGAAAAG AAGAAACAAG GTTGTCAACG 1320 

AGAAAATGAA ACTCTAATAC AAAGAAGAAA AGACCAGATG CAAC C AGGGG GCACTGCAAT 13 80 

TAGTGTTACA GTACCTTATA GAGTAGTAGA CCAGCCCCTT AAACTTATGC CTCAAGACTG 144 0 

GGACCGCGTT GTAGCCGTTT TTGTGCAGGG TCCTGCATGG CAGTTCAAAG GTTGGCCATG 1500 

GCTTTTGCCT GATGGATCAC CAGTTGATAT ATTTGCTAAA ATTAAAGCCT TCCATCTGAA 1560 

GTATGAT GAA GTTCGTCTGG AT C CAAATGT TCAGAAATGG GATGTAACAG TATTAGAACT 1620 

CAGCTATCAC AAACGTCATT TGGATAGACC AGTGTTCTTA CGGTTTTGGG AAACATTGGA 1680 

CAGGTACATG GTAAAGCATA AATCGCACTT GAGATTCTGA ATTATTTGGC TCCTCCATTT 1740 

CTGGAAATTG AGACTCAAGC TTTATGAATT TAT CAAGAAC TTAAAAATGA AGAAGGTCAC 1800 

AGATTGATCT TTTATAAGAC CTTATTTGAT GCTTTGTGCT TCAAGGAGAT GATACCTGTC 1860 

ATCCATATAA GCAAACTTTT TGGCTTACAA CTATTTTTTT AATATTAGCC TTCTAGTCTG 1920 

TAATGGAAAT TGTATATTTT GATAGAAGTT TTTTCTCCAT TGGTTAAATT AGCATTACTT 1980 

AAAATTTGTT TCTTTAGAAA ATAAATGCAG GTTATAAATG TGTGTATATT TAGAGATTAT 2 040 

AAGGCTCTCT GAGCCATCTT CTGATTTTTC ATTGCTCTAT AATTCTTTTT ACTGAAAATA 2100 

CTATGTTATG AATGGTATTA AATTTTAGTC TCTGGAACAT CCAAAACCAA GCAAAGGGAT 2160 

GTGACTATTT TGAATGAATC AGAATGTCAA CTTGTATGTA CACTATATGT AC ACTTACT C 2220 

ATTATTTAAA AAGAATAATG AAAAATCTAG AT CAATT CTT CAATTTGATT GAACTGTTCA 2280 

GCCTTTTCAA GATTTCTTTA TTTACAAATG ATTACATTTA AATGAATGTA CATTCTTCTC 234 0 

ACTGACTTTG GTGATTTTGA AACCTAGAAT GATGTGTTTC TATCTGTAAT ATCTTTCCAT 24 00 

TTGAAAAAAA TCTCAAAACA CAGATTAAAA CCACAAAAAA AAAAAAAAAA AAA 2453 
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Figure 2 



MADVLSVLRQYNIQKKEIWKGDEVIFGEFSWPKNVKTNYWWGTGKEGQPREYYTLDSI 
LFLLNNVHLSHPVYV*RRAATENIPWRRPDRKD*LLGYLNGEASTSASIDRSAPLEIGL 
QRSTQVKRAADEVLAEAKKPRIEDEECVRLDKERLAARLEGHKEGIVQTEQIRSLSEAMS 
VEKIAAIKAKIMAKKRSTIKTDLDDDITALKQRSFVDAEVDVTRDIVSRERVWRTRTTIL 
QSTGKNFSKNIFAILQSVKAREEGRAPEQRPAPNAAPVDPTLRTKQPIPAAYNRYDQERF 
KGKEETEGFKIDTMGTYHGMTLKSVTEGASARKTQTPAAQPVPRPVSQARPPPNQKKGSR 
TPIIIIPAATTSLITMLNAKDLLQDLKFVPSDEK*KKQGCQRENETLIQRRKD*QMQPGG 
TAISVTVPYRVVDQPLIOjMPQDWDRVVAVFVQGPAWQFKGWPWLLPDGSPVDIFAKIKAF 
HLETYDEVRLDPNVQKWDVTVLELSYHKRHLDRPVFLRFWETLDRYMVKHKSHLRF 
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Figure 3 



Human RNA Master Slot. 
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DSC64: Tissue Distribution 



50000 



100000 



< 

< 



heart 
prostate 
testis 
adrenal gland 
smal! intestine 
pituitary gland 
placenta 
liver 
stomach 
thyroid gland 
thymus 
appendix 
salivary gland 
fetal lung 
lung 
putamen 
spleen 
colon 
pancreas 
ovary 
kidney 
whole brain 
occipital lobe 
skeletal muscle 
spinal cord 
substantia nigra 
aorta 
lymph node 
fetal liver 
bone marrow 



i ■ 



4/5 



09/93797J 

PCT/US00/08751 

Figure 4(B) 
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COMBINED DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY 

(Includes Reference to PCT International Applications) 



ATTORNEY'S DOCKET NUMBER 

640100-430 
Customer No.: 27162 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names 
are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

Human Mesenchymal DNAs and Expression Products 
the specification of which (check only one item below): 



gpj was filed as United States application 
*PJ Serial No. 09/937,974 
yj on October 1, 2001 

and was amended on (if applicable) 

was filed as PCT international application 
'f° Number PCT/US00/08751 

on 31 March 2000 

and was amended under PCT Article 19 
O on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the claims, as amended 
by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the patentability of this application in accordance with Title 37, Code 
of Federal Regulations §1 .56(a). 

I hereby claim foreign priority benefits under Title 35 United States Code §119 of any foreign application(s) for patent or inventor's 
certificate or of any PCT international application(s) designating at least one country other than the United States of America listed below 
and have also identified below any foreign application(s) for patent or inventor's certificate or any PCT international application(s) 
designating at least one country other than the United States of America filed by me on the same subject matter having a filing date 
before that of the application(s) of which priority is claimed: 



PRIOR FOREIGN/PCT APPLICATION(S) AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. 119: 



COUNTRY 
(if PCT indicate PCT) 


APPLICATION NUMBER 


DATE OF FILING 
(day, month, year) 


PRIORITY CLAIMED 
UNDER 35 USC 119 
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NO 
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NO 
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YES 


□ 


NO 
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COMBINED DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY 
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ATTORNEYS DOCKET NUMBER 

640100^130 
Customer No.: 27162 


1 hereby claim the benefit under Title 35, United States Code, §1 20 or § 1 1 9 of any United States application(s) or PCT international application(s) designating 
the United States of America that is/are listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in that/those prior 
application(s) in the manner provided by the first paragraph of Title 35, United States Code, §112,1 acknowledge the duty to disclose material information as 
defined in Title 37, Code of Federal Regulations, §1 .56(a) which occurred between the filing date of the prior application(s) and the national or PCT international 
filing date of this application: 


PRIOR U.S. APPLICATION(S) OR PCT INTERNATIONAL APPLICATIONS DESIGNATING THE U.S. FOR BENEFIT UNDER 35 

U.S.C. 120 or U.S.C. 119: 


U.S. APPLICATIONS 


STATUS (Check one) 


U.S. APPLICATION NO. 


U.S. FILING DATE 


PATENTED 


PENDING 


ABANDONED 


60/127,418 
60/148,800 


1 April 1999 
1 3 August 1 999 




□ 


□ 


PCT APPLICATIONS DESIGNATING THE U.S. 




□ 


□ 


-PCT APPLICATION NO. 


PCT FILING DATE 


U.S. SERIAL NUMBERS 
ASSIGNED (if any) 




□ 


□ 


a;S PCT/US00/08751 


31 March 2000 








□ 


POWER 

22J846); E 


OF ATTORNEY: As a n 
t and Trademark Office 
Elliot M. Olstein (Reg. No 
_22^280Land Glennon T 


amed inventor, I her 
connected therewith 
^24,025); Raymond 


sby appoint the following attorney(s) and/or ag 
. (List name and registration number) Joh 
J. Lillie (Reg. No. 31 .778): William Squire (Rec 
39,050) 


snt(s) to prosecute this application and transact all business in 
n N. Bain ( Reg. No. 18,651): John G. Giifillan III (Reg. No. 
. No. 25.378): Alan Grant (Reg.HNb. 33,389), Francis C. Hand 


(Reg. No 


oublefield (Reg. No 




Cc 


rrespondence to: Alan J. Grant, Esq. 

Carella, Byrne, Bain, Gilfillan, Cecchi, Stewart & Olstein 




Direct Telephone Calls to: 
(name and telephone number) 
(973)994-1700 


^ R«r-.ker Farm Roa~a, Roselacid. New Jersey 07068 


'pi 


FULL NAME OF 
INVENTOR 


FAMILY NAME 

Van den Bos 




FIRST GIVEN NAME 

Christian 


SECOND GIVEN NAME 






RESIDENCE & 
CITIZENSHIP 


CITY 

Baltimore 


STATE OR FOREIGN COUNTRY 
Maryland . M -j— 

m n 


COUNTRY OF CITIZENSHIP 
Germany 


POST OFFICE 
ADDRESS 


POST OFFICE ADDRESS 
2214 East Fairmont Avenue 


CITY 

Baltimore 


STATE & ZIP CODE/COUNTRY 

Maryland 21231 


CN 
CD 
CM 


FULL NAME OF 
INVENTOR 


FAMILY NAME 
Mbalaviele 




FIRST GIVEN NAM 

Gabriel 


E 


SECOND GIVEN NAME 






RESIDENCE & 
CITIZENSHIP 


CITY 
O'Fallon 


STATE OR FOREIGN COUNTRY 

Missouri f^^tf^^) 


COUNTRY OF CITIZENSHIP 
Republic of Congo 


POST OFFICE 
ADDRESS 


POST OFFICE ADDRESS 

8 Chastain Court 


CITY 
O'Fallon 


STATE & ZIP CODE/COUNTRY 
Missouri, 63366 


co 

CD 
CM 


FULL NAME OF 
INVENTOR 


FAMILY NAME 


FIRST GIVEN NAME 


SECOND GIVEN NAME 


RESIDENCE & 
CITIZENSHIP 


CITY 


STATE OR FOREIGN COUNTRY 


COUNTRY OF CITIZENSHIP 


POST OFFICE 
ADDRESS 


POST OFFICE ADDRESS 


CITY 


STATE & ZIP CODE/COUNTRY 


I hereby 
and furth 
both, unc 
issuing t 


declare that all stateme 
er that these statement 
er section 1001 of Title 
lereon. 


its made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; 
s were made with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or 
18 of the United States Code, and that such willful false statements may jeopardize the validity of the application of any patent 
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DATE 
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